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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Choi et. al . 



FEB 2 0 2003 
TECH CENTER 1600/2900 




(ii) TITLE OF INVENTION: Streptococcus pneumoniae Antigens and 
Vaccines 

(iii) NUMBER OF SEQUENCES: 454 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 
{ D ) STATE : Maryland 

(E) COUNTRY: USA 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

(B) COMPUTER: Dell Latitude C610 

(C) OPERATING SYSTEM: Windows 2000 

(D) SOFTWARE: ASCII Text 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/765 , 272 

(B) FILING DATE: January 22, 2001 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/961,083 

(B) FILING DATE: OCT-30-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Lin J. Hymel 

(B) REGISTRATION NUMBER: 45,414 

(C) REFERENCE /DOCKET NUMBER: PB340P2C2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 610-5790 

(B) TELEFAX: (301) 309-8439 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



T AAAATC TAG 


GACAATAAAA 


TV rp/^ TV 7\ r^rn/*^ A T" 


rp/^/trp/^ TV /^mrn/-i 
1 tjU 1 VjAU i i 


/^pmrnpm/^ a a 




fin 


TGCCCAAGCT 


AATGATATTC 


O 7V TV TV fpinrii 




AiL-Vj i i iL. i A 


1 L. (jj AAVjAL. V- A 


1 90 

JL Z u 


TCGCTTCTTC 


GACCACAGGG 


GGATTGATAC 


CATCCGTATL 


L i VjjVjCjA^jjL. i i 


rpornrpooop A A 


1 fin 

± 0 VJ 


TCTGCAAAGC 


AATTCCCTCC 


AAGGTGGATC 


AAC TC TC AC C 


/"< A A A /^rnm/^ a 
L, AAC Atj i 1 VjA 


mm A AOmmOAO 
i 1 AAtj J. i Li At- 


9 A n 

Z ft u 


TTACTTTTCA 


ACTTCGACTT 


CCGACCAGAC 


TATTTCTCGT 


AAGGC TC AGG 


A A oommoomm 
AAUL i 1 1 i 


"3 n n 
0 u u 


AGCGATTCAG 


TTAGAACAAA 


AAGCAACCAA 


^/^A A^A A Arn/~t 

GCAAGAAATC 


TTGAL C i AL i 


AmAmA A AmA a 
Ai Al AAA! AA 


"3 n 
0 D u 


GGTCTACATG 


TCTAATGGGA 


TV m TV rT^ TV TV m 

AC TATGGAAT 


A A /"^ A A 

GCAGACAGCA 


GCTCAAAAL i 


A om A moom A a 
AL 1 A J. \3\3 i AA 


A9 n 

ftZ u 


AGACCTCAAT 


AATTTAAGTT 


mTv m TV mm 

TACCTCAGTT 


AGCCTTGCTG 


GC TGGAATGL 


omo A r*f^r* a 00 


/ton 
ft c5 u 


AAACCAATAT 


GACCCCTATT 


CACATCCAGA 


AGCAGCCCAA 


G AC L CjL- L. vj AA 


A ommoPT'On^T' 
AL. 1 i LjLf lull 


D ft u 


ATCTGAAATG 


TV TV TV Tl T\ m/^ TV TV 

AAAAATCAAG 


GCTACATCTC 


TGCTGAACAG 


i A i ACj AAALj 


P APT'P a ST* Tip 

ALr i L. AA 1 AU 


n n 
D u u 


ACCAATTACT 


GATGGACTAC 


TV TV TV /^mOm/^ TV TV 

AAAGTCTCAA 


A rr\r^ A 00 A A /^T> 

ATL ACjL AAb i 


AAi iAUL.L.iVj 


p T"T' a p a Tip P 2i 
L- 1 I AL- A i LiLjA 


. D D U 


TAATTACCTC 


TV TV TV TV m/~1 TV 

AAGGAAGTCA 


m/^ TV TV m/^ TV TV /*i m 

TCAATCAAGT 


m/^ A AOA A^A A 

TGAAGAAGAA 


A OA ooonn A rp A 
AC AGLjC i a i a 


A oorpA omo A p 

AL.L. i AL- i L. AL- 


7 9 0 
/ Z U 


AACTGGGATG 


TV m/^m/^ m tv tv 

GATGTCTACA 


y~» Ti TV Ti rn/~i m tv TV 

CAAATGTAGA 


C C AAG AAGC T 


OAA A AAOA rpO 

L AAAAAL-A 1 L 


morpooo A mTvm 

1 Lj 1 LfLtLtA i a i 


7 R 0 


TTACAATACA 


GACGAATACG 


TTGCCTATCC 


A A A mo A A 

AG AC G ATGAA 


TTGC AAG J. L b 


ommom A 00 a t* 
U i IL. 1 AL-LAi 


Q A 0 
Oft U 


TGTTGATGTT 


TCTAACGGTA 


TV TV m TV mm/^ 

AAGTCATTGC 


CCAGCTAGGA 


OOAOOOOA rpO 

GCACGCCATC 


A omo A A om A A 
AGICAALjI AA 


Q n n 
y u u 


TGTTTCCTTC 


GGAATTAACC 


TV TV TV m TV TV 

AAGCAGTAGA 


AAC AAACC GC 


0 A 0 mo A m 

GACTGGGGAT 


OA A 0 m A mo A A 
C AAC 1 A J. LiAA 


Q c n 
y 0 u 


ACCGATCACA 


GACTATGCTC 


CTGCCTTGGA 


GTACGGTGTC 


m A A mmO A A 

TAC GATTC AA 


omoom A om A m 
CTGCTAC 1 A 1 


1 n 0 n 
lUz u 


CGTTCACGAT 


GAGCCCTATA 


ACTACCCTGG 


TV TV TV TV m TV m 

GACAAATACT 


CCTGTTTATA 


A omooo A m a o 

ACTGGGATAG 


1 n Q ft 


GGGCTACTTT 


GGCAACATCA 


mr^ TV TV m tv 

CCTTGCAATA 


CGCCCTGCAA 


0 A A mOOOO A A 

CAATCGCGAA 


A oornooo A 00 
ACLj i LCLALiU 


1 1 /I n 


CGTGGAAACT 


CTAAACAAGG 


y^ TV /^m/^ TV TV 

TCGGACTCAA 


CCGCGCCAAG 


A ommmoomA a 

ACTTTCCTAA 


A moomomA 00 
AlLiLii L IALiLj 


1 9 nn 
±z uu 


AATCGACTAC 


CCAAGTATTC 


TV ^^rrTTv /^m^^ TV TV TV 

ACTACTCAAA 


TGCCATTTCA 


A Om A A 0 A 0 A A 

AGTAACACAA 


000 A A mo A 0 A 

L-L.LrAA 1 L-ALjA 


T 0 c n 

±Z DU 


CAAAAAATAT 


TV TV TV m TV 

GGAGCAAGTA 


/^m/^ Tl TV TV TV TV fTI 

GTGAAAAGAT 


GGC TGC TGC T 


TACGCiVjLU i 


rpmoo AAA T^PP 

1 1 LrL, AAA i LjLj 


17 9 0 
i J Z u 


TGGAACTTAC 


TATAAACCAA 


TGTATATC C A 


m A A AOmo/^mo 

TAAAGTC GTC 


mm rp a omo a mo 
TTTAG i oA 1 (j 


LtLi ALr i LiAAAA 


1 7R0 
J- J 0 u 


AGAGTTCTCT 


AATGTCGGAA 


CTCGTGCCAT 


0 A A A A A 


AO AOOOmATiA 

AL ACjL-C i a 1 a 


TPaTipapppa 
i LtA i LjAL. L-LiA 


1 AAC\ 
J-ftft U 


CATGATGAAA 


ACAGTCTTGA 


CTTATGGAAC 


TGGACGAAAT 


GCCTATCTTG 


CTTGGCTCCC 


1500 


TCAGGCTGGT 


AAAACAGGAA 


CCTCTAACTA 


TACAGACGAG 


GAAATTGAAA 


ACCACATCAA 


1560 


GACCTCTCAA 


TTTGTAGCAC 


CTGATGAACT 


ATTTGCTGGC 


TATACGCGTA 


AATATTCAAT 


1620 


GGCTGTATGG 


ACAGGCTATT 


CTAACCGTCT 


GACACCACTT 


GTAGGCAATG 


GCCTTACGGT 


1680 


CGCTGCCAAA 


GTTTACCGCT 


CTATGATGAC 


CTACCTGTCT 


GAAGGAAGCA 


ATCCAGAAGA 


1740 


TTGGAATATA 


CCAGAGGGGC 


TCTACAGAAA 


TGGAGAATTC 


GTATTTAAAA 


ATGGTGCTCG 


1800 



i 
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TTCTACGTGG AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CAAGCTCATC 1860 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA ATAGTACGAC 1920 

TACCAATCCT AACAATAATA CGCAACAATC AAATACAACC CCTGATCAAC AAAATCAGAA 1980 

TCCTCAACCA GCACAACCA 1999 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Lys lie Tyr Asp Asn Lys Asn Gin Leu lie Ala Asp Leu Gly Ser Glu 
1 5 10 15 

Arg Arg Val Asn Ala Gin Ala Asn Asp lie Pro Thr Asp Leu Val Lys 
20 25 30 

Ala lie Val Ser lie Glu Asp His Arg Phe Phe Asp His Arg Gly lie 
35 40 45 

Asp Thr lie Arg lie Leu Gly Ala Phe Leu Arg Asn Leu Gin Ser Asn 
50 55 60 

Ser Leu Gin Gly Gly Ser Thr Leu Thr Gin Gin Leu lie Lys Leu Thr 
65 70 75 80 

Tyr Phe Ser Thr Ser Thr Ser Asp Gin Thr lie Ser Arg Lys Ala Gin 
85 90 95 

Glu Ala Trp Leu Ala lie Gin Leu Glu Gin Lys Ala Thr Lys Gin Glu 
100 105 110 

lie Leu Thr Tyr Tyr lie Asn Lys Val Tyr Met Ser Asn Gly Asn Tyr 
115 120 125 

Gly Met Gin Thr Ala Ala Gin Asn Tyr Tyr Gly Lys Asp Leu Asn Asn 
130 135 140 

Leu Ser Leu Pro Gin Leu Ala Leu Leu Ala Gly Met Pro Gin Ala Pro 
145 150 155 160 

Asn Gin Tyr Asp Pro Tyr Ser His Pro Glu Ala Ala Gin Asp Arg Arg 
165 170 175 

Asn Leu Val Leu Ser Glu Met Lys Asn Gin Gly Tyr lie Ser Ala Glu 
180 185 190 

Gin Tyr Glu Lys Ala Val Asn Thr Pro lie Thr Asp Gly Leu Gin Ser 
195 200 205 

Leu Lys Ser Ala Ser Asn Tyr Pro Ala Tyr Met Asp Asn Tyr Leu Lys 
210 215 220 

Glu Val He Asn Gin Val Glu Glu Glu Thr Gly Tyr Asn Leu Leu Thr 



225 230 ' 235 240 

Thr Gly Met Asp Val Tyr Thr Asn Val Asp Gin Glu Ala Gin Lys His 
245 250 255 

Leu Trp Asp lie Tyr Asn Thr Asp Glu Tyr Val Ala Tyr Pro Asp Asp 
260 265 270 

Glu Leu Gin Val Ala Ser Thr lie Val Asp Val Ser Asn Gly Lys Val 
275 280 285 

lie Ala Gin Leu Gly Ala Arg His Gin Ser Ser Asn Val Ser Phe Gly 
290 295 300 

lie Asn Gin Ala Val Glu Thr Asn Arg Asp Trp Gly Ser Thr Met Lys 
305 310 315 320 

Pro lie Thr Asp Tyr Ala Pro Ala Leu Glu Tyr Gly Val Tyr Asp Ser 
325 330 335 

Thr Ala Thr lie Val His Asp Glu Pro Tyr Asn Tyr Pro Gly Thr Asn 
340 345 350 

Thr Pro Val Tyr Asn Trp Asp Arg Gly Tyr Phe Gly Asn lie Thr Leu 
355 360 365 

Gin Tyr Ala Leu Gin Gin Ser Arg Asn Val Pro Ala Val Glu Thr Leu 
370 375 380 

Asn Lys Val Gly Leu Asn Arg Ala Lys Thr Phe Leu Asn Gly Leu Gly 
385 390 395 400 

lie Asp Tyr Pro Ser lie His Tyr Ser Asn Ala lie Ser Ser Asn Thr 
405 410 415 

Thr Glu Ser Asp Lys Lys Tyr Gly Ala Ser Ser Glu Lys Met Ala Ala 
420 425 430 

Ala Tyr Ala Ala Phe Ala Asn Gly Gly Thr Tyr Tyr Lys Pro Met Tyr 
435 440 445 

lie His Lys Val Val Phe Ser Asp Gly Ser Glu Lys Glu Phe Ser Asn 
450 455 460 

Val Gly Thr Arg Ala Met Lys Glu Thr Thr Ala Tyr Met Met Thr Asp 
465 470 475 480 

Met Met Lys Thr Val Leu Thr Tyr Gly Thr Gly Arg Asn Ala Tyr Leu 
485 490 495 

Ala Trp Leu Pro Gin Ala Gly Lys Thr Gly Thr Ser Asn Tyr Thr Asp 
500 505 510 

Glu Glu lie Glu Asn His lie Lys Thr Ser Gin Phe Val Ala Pro Asp 
515 520 525 

Glu Leu Phe Ala Gly Tyr Thr Arg Lys Tyr Ser Met Ala Val Trp Thr 
530 535 540 

Gly Tyr Ser Asn Arg Leu Thr Pro Leu Val Gly Asn Gly Leu Thr Val 
545 550 555 560 

Ala Ala Lys Val Tyr Arg Ser Met Met Thr Tyr Leu Ser Glu Gly Ser 
565 570 575 
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Asn Pro Glu Asp Trp Asn lie Pro Glu Gly Leu Tyr Arg Asn Gly Glu 
580 585 590 

Phe Val Phe Lys Asn Gly Ala Arg Ser Thr Trp Asn Ser Pro Ala Pro 
595 600 605 

Gin Gin Pro Pro Ser Thr Glu Ser Ser Ser Ser Ser Ser Asp Ser Ser 
610 615 620 

Thr Ser Gin Ser Ser Ser Thr Thr Pro Ser Thr Asn Asn Ser Thr Thr 
625 630 635 640 

Thr Asn Pro Asn Asn Asn Thr Gin Gin Ser Asn Thr Thr Pro Asp Gin 
645 650 655 

Gin Asn Gin Asn Pro Gin Pro Ala Gin Pro 
660 665 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



AAATTACAAT 


ACGGACTATG 


AATTGACCTC 


TGGAGAAAAA 


TTACCTCTTC 


CTAAAGAGAT 


60 


TTCAGGTTAC 


ACTTATATTG 


GATATATCAA 


AGAGGGAAAA 


ACGACTTCTG 


AGTCTGAAGT 


120 


AAGTAATCAA 


AAGAGTTCAG 


TTGCCACTCC 


TACAAAACAA 


CAAAAGGTGG 


ATTATAATGT 


180 


TACACCGAAT 


TTTGTAGACC 


ATCCATCAAC 


AGTACAAGCT 


ATTCAGGAAC 


AAACACCTGT 


240 


TTCTTCAACT 


AAGCCGACAG 


AAGTTCAAGT 


AGTTGAAAAA 


CCTTTCTCTA 


CTGAATTAAT 


300 


CAATCCAAGA 


AAAGAAGAGA 


AACAATCTTC 


AGATTCTCAA 


GAACAATTAG 


CCGAACATAA 


360 


GAATCTAGAA 


ACGAAGAAAG 


AGGAGAAGAT 


TTCTCCAAAA 


GAAAAGACTG 


GGGTAAATAC 


420 


ATTAAATCCA 


CAGGATGAAG 


TTTTATCAGG 


TCAATTGAAC 


AAACCTGAAC 


TCTTATATCG 


480 


TGAGGAAACT 


ATGGAGACAA 


AAATAGATTT 


TCAAGAAGAA 


ATTCAAGAAA 


ATCCTGATTT 


540 


AGCTGAAGGA 


ACTGTAAGAG 


TAAAACAAGA 


AGGTAAATTA 


GGTAAGAAAG 


TTGAAATCGT 


600 


CAGAATATTC 


TCTGTAAACA 


AGGAAGAAGT 


TTCGCGAGAA 


ATTGTTTCAA 


CTTCAACGAC 


660 


TGCGCCTAGT 


CCAAGAATAG 


TCGAAAAAGG 


TACTAAAAAA 


ACTCAAGTTA 


TAAAGGAACA 


720 


ACCTGAGACT 


GGTGTAGAAC 


ATAAGGACGT 


ACAGTCTGGA 


GCTATTGTTG 


AACCCGCAAT 


780 


TCAGCCTGAG 


TTGCCCGAAG 


CTGTAGTAAG 


TGACAAAGGC 


GAACCAGAAG 


TTCAACCTAC 


840 


ATTACCCGAA 


GCAGTTGTGA 


CCGACAAAGG 


TGAGACTGAG 


GTTCAACCAG 


AGTCGCCAGA 


900 


TACTGTGGTA 


AGTGATAAAG 


GTGAACCAGA 


GCAGGTAGCA 


CCGCTTCCAG 


AATATAAGGG 


960 


TAATATTGAG 


CAAGTAAAAC 


CTGAAACTCC 


GGTTGAGAAG 


ACCAAAGAAC 


AAGGTCCAGA 


1020 
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AAAAACTGAA GAAGTTCCAG TAAAACCAAC AGAAGAAACA CCAGTAAATC CAAATGAAGG 1080 

TACTACAGAA GGAACCTCAA TTCAAGAAGC AGAAAATCCA GTTCAACCTG CAGAAGAATC 1140 

AACAACGAAT TCAGAGAAAG TATCACCAGA TACATCTAGC AAAAATACTG GGGAAGTGTC 1200 

CAGTAATCCT AGTGATTCGA CAACCTCAGT TGGAGAATCA AATAAACCAG AACATAATGA 1260 

CTCTAAAAAT GAAAATTCAG AAAAAACTGT AGAAGAAGTT CCAGTAAATC CAAATGAAGG 1320 

CACAGTAGAA GGTACCTCAA ATCAAGAAAC AGAAAAACCA GTTCAACCTG CAGAAGAAAC 1380 

ACAAACAAAC TCTGGGAAAA TAGCTAACGA AAATACTGGA GAAGTATCCA ATAAACCTAG 1440 

TGATTCAAAA CCACCAGTTG AAGAATCAAA TCAACCAGAA AAAAACGGAA CTGCAACAAA 1500 

ACCAGAAAAT TCAGGTAATA CAACATCAGA GAATGGACAA ACAGAACCAG AACCATCAAA 1560 

CGGAAATTCA ACTGAGGATG TTTCAACCGA ATCAAACACA TCCAATTCAA ATGGAAACGA 162 0 

AGAAATTAAA CAAGAAAATG AACTAGACCC TGATAAAAAG GTAGAAGAAC CAGAGAAAAC 1680 

ACTTGAATTA AGAAATGTTT CCGACCTAGA GTTA 1714 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asn Tyr Asn Thr Asp Tyr Glu Leu Thr Ser Gly Glu Lys Leu Pro Leu 
1 5 10 15 

Pro Lys Glu lie Ser Gly Tyr Thr Tyr lie Gly Tyr lie Lys Glu Gly 
20 25 30 

Lys Thr Thr Ser Glu Ser Glu Val Ser Asn Gin Lys Ser Ser Val Ala 
35 40 45 

Thr Pro Thr Lys Gin Gin Lys Val Asp Tyr Asn Val Thr Pro Asn Phe 
50 55 60 

Val Asp His Pro Ser Thr Val Gin Ala lie Gin Glu Gin Thr Pro Val 
65 70 75 80 

Ser Ser Thr Lys Pro Thr Glu Val Gin Val Val Glu Lys Pro Phe Ser 
85 90 95 

Thr Glu Leu lie Asn Pro Arg Lys Glu Glu Lys Gin Ser Ser Asp Ser 
100 105 110 

Gin Glu Gin Leu Ala Glu His Lys Asn Leu Glu Thr Lys Lys Glu Glu 
115 120 125 

Lys lie Ser Pro Lys Glu Lys Thr Gly Val Asn Thr Leu Asn Pro Gin 
130 135 140 



Asp Glu Val Leu Ser Gly Gin Leu Asn Lys Pro Glu Leu Leu Tyr Arg 
145 150 155 160 

Glu Glu Thr Met Glu Thr Lys lie Asp Phe Gin Glu Glu lie Gin Glu 
165 170 175 

Asn Pro Asp Leu Ala Glu Gly Thr Val Arg Val Lys Gin Glu Gly Lys 
180 185 190 

Leu Gly Lys Lys Val Glu lie Val Arg lie Phe Ser Val Asn Lys Glu 
195 200 205 

Glu Val Ser Arg Glu lie Val Ser Thr Ser Thr Thr Ala Pro Ser Pro 
210 215 220 . 

Arg lie Val Glu Lys Gly Thr Lys Lys Thr Gin Val lie Lys Glu Gin 
225 230 235 240 

Pro Glu Thr Gly Val Glu His Lys Asp Val Gin Ser Gly Ala lie Val 
245 250 255 

Glu Pro Ala lie Gin Pro Glu Leu Pro Glu Ala Val Val Ser Asp Lys 
260 265 270 

Gly Glu Pro Glu Val Gin Pro Thr Leu Pro Glu Ala Val Val Thr Asp 
275 280 285 

Lys Gly Glu Thr Glu Val Gin Pro Glu Ser Pro Asp Thr Val Val Ser 
290 295 300 

Asp Lys Gly Glu Pro Glu Gin Val Ala Pro Leu Pro Glu Tyr Lys Gly 
305 310 315 320 

Asn lie Glu Gin Val Lys Pro Glu Thr Pro Val Glu Lys Thr Lys Glu 
325 330 335 

Gin Gly Pro Glu Lys Thr Glu Glu Val Pro Val Lys Pro Thr Glu Glu 
340 345 350 

Thr Pro Val Asn Pro Asn Glu Gly Thr Thr Glu Gly Thr Ser lie Gin 
355 360 365 

Glu Ala Glu Asn Pro Val Gin Pro Ala Glu Glu Ser Thr Thr Asn Ser 
370 375 380 

Glu Lys Val Ser Pro Asp Thr Ser Ser Lys Asn Thr Gly Glu Val Ser 
385 390 395 400 

Ser Asn Pro Ser Asp Ser Thr Thr Ser Val Gly Glu Ser Asn Lys Pro 
405 410 415 

Glu His Asn Asp Ser Lys Asn Glu Asn Ser Glu Lys Thr Val Glu Glu 
420 425 430 

Val Pro Val Asn Pro Asn Glu Gly Thr Val Glu Gly Thr Ser Asn Gin 
435 440 445 

Glu Thr Glu Lys Pro Val Gin Pro Ala Glu Glu Thr Gin Thr Asn Ser 
450 455 460 

Gly Lys lie Ala Asn Glu Asn Thr Gly Glu Val Ser Asn Lys Pro Ser 
465 470 475 480 
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Asp Ser Lys Pro Pro Val Glu Glu Ser Asn Gin Pro Glu Lys Asn Gly 
485 490 495 

Thr Ala Thr Lys Pro Glu Asn Ser Gly Asn Thr Thr Ser Glu Asn Gly 
500 505 510 

Gin Thr Glu Pro Glu Pro Ser Asn Gly Asn Ser Thr Glu Asp Val Ser 
515 520 525 

Thr Glu Ser Asn Thr Ser Asn Ser Asn Gly Asn Glu Glu lie Lys Gin 
530 535 540 

Glu Asn Glu Leu Asp Pro Asp Lys Lys Val Glu Glu Pro Glu Lys Thr 
545 550 555 560 

Leu Glu Leu, Arg Asn Val Ser Asp Leu Glu Leu 
565 570 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TGAGAATCAA GCTACACCCA AAGAGACTAG CGCTCAAAAG ACAATCGTCC TTGCTACAGC 60 

TGGCGACGTG CCACCATTTG ACTACGAAGA CAAGGGCAAT CTGACAGGCT TTGATATCGA 120 

AGTTTTAAAG GCAGTAGATG AAAAACTCAG CGACTACGAG ATTCAATTCC AAAGAACCGC 180 

CTGGGAGAGC ATCTTCCCAG GACTTGATTC TGGTCACTAT CAGGCTGCGG CCAATAACTT 240 

GAGTTACACA AAAGAGCGTG CTGAAAAATA CCTTTACTCG CTTCCAATTT CCAACAATCC 300 

CCTCGTCCTT GTCAGCAACA AGAAAAATCC TTTGACTTCT CTTGACCAGA TCGCTGGTAA 360 

AACAACACAA GAGGATACCG GAACTTCTAA CGCTCAATTC ATCAATAACT GGAATCAGAA 420 

ACACACTGAT AATCCCGCTA CAATTAATTT TTCTGGTGAG GATATTGGTA AACGAATCCT 480 

AGACCTTGCT AACGGAGAGT TTGATTTCCT AGTTTTTGAC AAGGTATCCG TTCAAAAGAT 540 

TATCAAGGAC CGTGGTTTAG ACCTCTCAGT CGTTGATTTA CCTTCTGCAG ATAGCCCCAG 600 

CAATTATATC ATTTTCTCAA GCGACCAAAA AGAGTTTAAA GAGCAATTTG ATAAAGCGCT 660 

CAAAGAACTC TATCAAGACG GAACCCTTGA AAAACTCAGC AATACCTATC TAGGTGGTTC 720 

TTACCTCCCA GATCAATCTC AGTTACAA 748 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Glu Asn Gin Ala Thr Pro Lys Glu Thr Ser Ala Gin Lys Thr lie Val 
1 5 10 15 

Leu Ala Thr Ala Gly Asp Val Pro Pro Phe Asp Tyr Glu Asp Lys Gly 
20 25 30 

Asn Leu Thr Gly Phe Asp lie Glu Val Leu Lys Ala Val Asp Glu Lys 
35 40 45 

Leu Ser Asp Tyr Glu lie Gin Phe Gin Arg Thr Ala Trp Glu Ser lie 
50 55 60 

Phe Pro Gly Leu Asp Ser Gly His Tyr Gin Ala Ala Ala Asn Asn Leu 
65 70 75 80 

Ser Tyr Thr Lys Glu Arg Ala Glu Lys Tyr Leu Tyr Ser Leu Pro lie 
85 90 95 

Ser Asn Asn Pro Leu Val Leu Val Ser Asn Lys Lys Asn Pro Leu Thr 
100 105 110 

Ser Leu Asp Gin lie Ala Gly Lys Thr Thr Gin Glu Asp Thr Gly Thr 
115 120 125 

Ser Asn Ala Gin Phe lie Asn Asn Trp Asn Gin Lys His Thr Asp Asn 
130 135 140 

Pro Ala Thr lie Asn Phe Ser Gly Glu Asp lie Gly Lys Arg lie Leu 
145 150 155 160 

Asp Leu Ala Asn Gly Glu Phe Asp Phe Leu Val Phe Asp Lys Val Ser 
165 170 175 

Val Gin Lys lie lie Lys Asp Arg Gly Leu Asp Leu Ser Val Val Asp 
180 185 190 

Leu Pro Ser Ala Asp Ser Pro Ser Asn Tyr lie lie Phe Ser Ser Asp 
195 200 205 

Gin Lys Glu Phe Lys Glu Gin Phe Asp Lys Ala Leu Lys Glu Leu Tyr 
210 215 220 

Gin Asp Gly Thr Leu Glu Lys Leu Ser Asn Thr Tyr Leu Gly Gly Ser 
225 230 235 240 

Tyr Leu Pro Asp Gin Ser Gin Leu Gin 
245 

INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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TGGTAACCGC 


TCTTCTCGTA 




A rnorpmorpo a rn 
A i L. J. i U A J. 


r^fvr* A A A r* A A 
Kd 1 LxAAtjjAL AA 


A A r^r* A r^r* s a t" 
AAtaC ALjL, AA i 


fin 


/^/^m/^ TV /~*m^ TV m 

CGTCACTGAT 


ACTGGTGGTG 




AiL-Ai IL.AAL. 


L,AA i L. AvjL i i 


LrtjLiAALiVj J. 1 1 


JL ^ U 


GCAGGCTTGG 


/^r^rPTV A TVOA 

GGTAAAGAAC 


A A A rpr^rprnrn/^ 

ACAAiL. i i i L 


AA AAOAT^AAr^ 

AAAAVj A i AAC 




TV ornmoo A ATT^ 

AC i i CL.AA 1 L. 


X o u 


AA.CAAGTGAA 


/~»/^m/^TV/~*rnTV/^o 

GCTGACTACG 


C i AAL AAL 1 1 


or* A A r' A A f^r*r^ 
Lj L AAL. AAUL. U 


oomo/^ A A r^mm 


Ar" A arr'TTi at* 
AC AAUU 1 AA 1 


9 ZL n 

Z fi U 


CTTCGGTGTT 


GGTTTTGCCC 


mm A A m A A rn/~*f~^ 

TTAATAATGL 


A o mm A A A A m 
AGi iAAAUAl 


OOAOOAAAAO 

GLAGLAAAAG 


A A o A A mr^ a 
AAC AC AC i CA 


inn 
^ u u 


CTTGAACTAT 


GTCTTGATTG 


ATGATGTGAT 


mA A AOAOOA A 

r AAAGAC C AA 


A AOA Amommo 
AAGAAlGi IG 


OO A or^r^m A a 
C GACC C i AAC 


1 c n 
J D u 


TTTCGCTGAT 


TV TV m/^ TV /**im/*i TV 

AATGAGTCAG 


GTTACCTTGC 


A oomomooom 

AGGTGTGGLT 


OO A OO AAAAA 

GLAGC AAAAA 


OAA/^mAAOAO 

C AAC 1 AAC AC 


/ion 


AAAACAAGTT 


GGTTTTGTAG 


GTGGTATC GA 


TV momo 71 TV /^mm 

ATCTGAAGTT 


ATCTCTCGTT 


mmo A A OO A OO 
T 1 GAAGC AGG 


/ion 


ATTCAAGGCT 


GGTGTTGCGT 


/-I 7\ ^mTV O A /""/^O 

C AGT AG AC C C 


A mom A mo a a a 
ATLl ATL AAA 


omoo A Aommo 
GiCCAAGi IG 


A r^m A r^r^r^T^r^r^ 
AC i ACCC i CC 




TTCATTTGGT 


GATGCGGCTA 


A TV /~«/^mA AAA 

AAGGTAAAAC 


A AmmooAOOO 

AATTGCAGLC 


OO A o A A m A OO 

GC AC AA 1 ACG 


o A ooooonnoo 
CAGCCGG i GC 


n n 
o u u 


AGATATTGTT 


m TV /^^^TV TV ^^mTV 

TAG C AAGT AG 


C TGGTGGT AC 


A oomooAOom 

AGGTGCAGGT 


Omommmoo a o 

GTC TTTGC AG 


A r^r^r^ a a a a mo 
AGGC AAAA i C 


D D U 


m/^rT^^*^TV TV /^^*^Ti TV 

TCTCAACGAA 


AGCCGTCCTG 


A A A A mo A A A TV 

AAAATGAAAA 


AG Hi GGG i 1 


A moor^m/^mm/^ 
Al CGGiGl iG 


TV rpo orno A O O A 

AiCCiCACCA 


7 9 n 


TV TV TV TV TV 7k 

AGAAGCAGAA 


/^/^rm\ "A TV mTV o tv 

GGTAAATACA 


ommom A a a o a 
L r i L 1 AAACjA 


moor^ A A A A A 
1 GGC AAAGAA 


■ m/^ AAA /^mTim/^ 
i C AAAC i 1 ILr 


rnrporpmorp A mo 


7 Rn 


TACTTTGAAA 


CAAGTTGGTA 


CAACTGTAAA 


AGATATTTCT 


AACAAGGCAG 


AAAGAGGAGA 


840 


ATTCCCTGGC 


GGTCAAGTGA 


TCGTTTACTC 


ATTGAAGGAT 


AAAGGGGTTG 


ACTTGGCAGT 


900 


AACAAACCTT 


TCAGAAGAAG 


GTAAAAAAGC 


TGTCGAAGAT 


GCAAAAGCTA 


AAATCCTTGA 


960 


TGGAAGCGTA 


AAAGTTCCTG 


AAAAA 








985 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

Gly Asn Arg Ser Ser Arg Asn Ala Ala Ser Ser Ser Asp Val Lys Thr 
15 10 15 

Lys Ala Ala lie Val Thr Asp Thr Gly Gly Val Asp Asp Lys Ser Phe 
20 25 30 

Asn Gin Ser Ala Trp Glu Gly Leu Gin Ala Trp Gly Lys Glu His Asn 
35 40 45 

Leu Ser Lys Asp Asn Gly Phe Thr Tyr Phe Gin Ser Thr Ser Glu Ala 
50 55 60 

Asp Tyr Ala Asn Asn Leu Gin Gin Ala Ala Gly Ser Tyr Asn Leu He 
65 70 75 80 

Phe Gly Val Gly Phe Ala Leu Asn Asn Ala Val Lys Asp Ala Ala Lys 
85 90 95 
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Glu His Thr Asp Leu Asn Tyr Val Leu lie Asp Asp Val lie Lys Asp 
100 105 110 

Gin Lys Asn Val Ala Ser Val Thr Phe Ala Asp Asn Glu Ser Gly Tyr 
115 120 125 

Leu Ala Gly Val Ala Ala Ala Lys Thr Thr Lys Thr Lys Gin Val Gly 
130 135 140 

Phe Val Gly Gly lie Glu Ser Glu Val lie Ser Arg Phe Glu Ala Gly 
145 150 155 160 

Phe Lys Ala Gly Val Ala Ser Val Asp Pro Ser lie Lys Val Gin Val 
165 170 175 

Asp Tyr Ala Gly Ser Phe Gly Asp Ala Ala Lys Gly Lys Thr lie Ala 
180 185 190 

Ala Ala Gin Tyr Ala Ala Gly Ala Asp lie Val Tyr Gin Val Ala Gly 
195 200 205 

Gly Thr Gly Ala Gly Val Phe Ala Glu Ala Lys Ser Leu Asn Glu Ser 
210 215 220 

Arg Pro Glu Asn Glu Lys Val Trp Val lie Gly Val Asp Arg Asp Gin 
225 230 235 240 

Glu Ala Glu Gly Lys Tyr Thr Ser Lys Asp Gly Lys Glu Ser Asn Phe 
245 250 255 

Val Leu Val Ser Thr Leu Lys Gin Val Gly Thr Thr Val Lys Asp lie 
260 265 270 

Ser Asn Lys Ala Glu Arg Gly Glu Phe Pro Gly Gly Gin Val lie Val 
275 280 285 

Tyr Ser Leu Lys Asp Lys Gly Val Asp Leu Ala Val Thr Asn Leu Ser 
290 295 300 

Glu Glu Gly Lys Lys Ala Val Glu Asp Ala Lys Ala Lys lie Leu Asp 
305 310 315 320 

Gly Ser Val Lys Val Pro Glu Lys 
325 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TGTGGAAATT TGACAGGTAA CAGCAAAAAA GCTGCTGATT CAGGTGACAA ACCTGTTATC 60 

AAAATGTACC AAATCGGTGA CAAACCAGAC AACTTGGATG AATTGTTAGC AAATGCCAAC 120 

AAAATCATTG AAGAAAAAGT TGGTGCCAAA TTGGATATCC AATACCTTGG CTGGGGTGAC 180 

TATGGTAAGA AAATGTCAGT TATCACATCA TCTGGTGAAA ACTATGATAT TGCCTTTGCA 240 
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GATAACTATA 


TTGTAAATGC 


TCAAAAAGGT 


GCTTACGCTG 


ACTTGACAGA 


ATTGTACAAA 


300 


AAAGAAGGTA 


AAGACCTTTA 


CAAAGCACTT 


GACCCAGCTT 


ACATCAAGGG 


TAATACTGTA 


360 


AATGGTAAGA 


TTTACGCTGT 


TCCAGTTGCA 


GCCAACGTTG 


CATCATCTCA 


AAACTTTGCC 


420 


TTCAACGGAA 


CTCTCCTTGC 


TAAATATGGT 


ATCGATATTT 


CAGGTGTTAC 


TTCTTACGAA 


480 


ACTCTTGAGC 


CAGTCTTGAA 


ACAAATCAAA 


GAAAAAGCTC 


CAGACGTAGT 


ACCATTTGCT 


540 


ATTGGTAAAG 


TTTTCATCCC 


ATCTGATAAT 


TTTGACTACC 


CAGTAGCAAA 


CGGTCTTCCA 


600 


TTCGTTATCG 


ACCTTGAAGG 


CGATACTACT 


AAAGTTGTAA 


ACCGTTACGA 


AGTGCCTCGT 


660 


TTCAAAGAAC 


ACTTGAAGAC 


TCTTCACAAA 


TTCTATGAAG 


CTGGCTACAT 


TCCAAAAGAC 


720 


GTCGCAACAA 


GCGATACTTC 


CTTTGACCTT 


CAACAAGATA 


CTTGGTTCGT 


TCGTGAAGAA 


780 


ACAGTAGGAC 


CAGCTGACTA 


CGGTAACAGC 


TTGCTTTCAC 


GTGTTGCCAA 


CAAAGATATC 


840 


CAAATCAAAC 


CAATTACTAA 


CTTCATCAAG 


NAAAACCAAA 


CAACACAAGT 


TGCTAACTTT 


900 


GTCATCTCAA 


ACAACTCTAA 


GAACAAAGAA 


AAATCAATGG 


AAATCTTGAA 


CCTCTTGAAT 


960 


ACGAACCCAG 


AACTCTTGAA 


CGGTCTTGTT 


TACGGTCCAG 


AAGGCAAGAA 


CTGGGAAAAA 


1020 


ATTGAAGGTA 


AAGAAAACCG 


TGTTCGCGTT 


CTTGATGGCT 


ACAAAGGAAA 


CACTCACATG 


1080 


GGTGGATGGA 


ACACTGGTAA 


CAACTGGATC 


CTTTACATCA 


ACGAAAACGT 


TACAGACCAA 


1140 


CAAATCGAAA 


ATTCTAAGAA 


AGAATTGGCA 


GAAGCTAAAG 


AATCTCCAGC 


GCTTGGATTT 


1200 


ATCTTCAATA 


CTGACAATGT 


GAAATCTGAA 


ATCTCAGCTA 


TTGCTAACAC 


AATGCAACAA 


1260 


TTTGATACAG 


CTATCAACAC 


TGGTACTGTA 


GACCCAGATA 


AAGCGATTCC 


AGAATTGATG 


1320 


GAAAAATTGA 


AATCTGAAGG 


TGCCTACGAA 


AAAGTATTGA 


ACGAAATGCA 


AAAACAATAC 


1380 


GATGAATTCT 


TGAAAAACAA 


AAAA 








1404 


(2) INFORMATION FOR SEQ ID NO: 10: 











(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Gly Asn Leu Thr Gly Asn Ser Lys Lys Ala Ala Asp Ser Gly Asp 
1 5 10 15 

Lys Pro Val lie Lys Met Tyr Gin lie Gly Asp Lys Pro Asp Asn Leu 
20 25 30 

Asp Glu Leu Leu Ala Asn Ala Asn Lys lie lie Glu Glu Lys Val Gly 
35 40 45 

Ala Lys Leu Asp lie Gin Tyr Leu Gly Trp Gly Asp Tyr Gly Lys Lys 
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50 55 60 

Met Ser Val lie Thr Ser Ser Gly Glu Asn Tyr Asp lie Ala Phe Ala 
65 70 75 80 

Asp Asn Tyr lie Val Asn Ala Gin Lys Gly Ala Tyr Ala Asp Leu Thr 
85 90 95 

Glu Leu Tyr Lys Lys Glu Gly Lys Asp Leu Tyr Lys Ala Leu Asp Pro 
100 105 110 

Ala Tyr lie Lys Gly Asn Thr Val Asn Gly Lys lie Tyr Ala Val Pro 
115 120 125 

Val Ala Ala Asn Val Ala Ser Ser Gin Asn Phe Ala Phe Asn Gly Thr 
130 135 140 

Leu Leu Ala Lys Tyr Gly lie Asp lie Ser Gly Val Thr Ser Tyr Glu 
145 150 155 160 

Thr Leu Glu Pro Val Leu Lys Gin lie Lys Glu Lys Ala Pro Asp Val 
165 170 175 

Val Pro Phe Ala lie Gly Lys Val Phe lie Pro Ser Asp Asn Phe Asp 
180 185 190 

Tyr Pro Val Ala Asn Gly Leu Pro Phe Val lie Asp Leu Glu Gly Asp 
195 200 205 

Thr Thr Lys Val Val Asn Arg Tyr Glu Val Pro Arg Phe Lys Glu His 
210 215 220 

Leu Lys Thr Leu His Lys Phe Tyr Glu Ala Gly Tyr lie Pro Lys Asp 
225 230 235 240 

Val Ala Thr Ser Asp Thr Ser Phe Asp Leu Gin Gin Asp Thr Trp Phe 
245 250 255 

Val Arg Glu Glu Thr Val Gly Pro Ala Asp Tyr Gly Asn Ser Leu Leu 
260 265 270 

Ser Arg Val Ala Asn Lys Asp lie Gin lie Lys Pro lie Thr Asn Phe 
275 280 285 

lie Lys Xaa Asn Gin Thr Thr Gin Val Ala Asn Phe Val lie Ser Asn 
290 295 300 

Asn Ser Lys Asn Lys Glu Lys Ser Met Glu lie Leu Asn Leu Leu Asn 
305 310 315 320 

Thr Asn Pro Glu Leu Leu Asn Gly Leu Val Tyr Gly Pro Glu Gly Lys 
325 330 335 

Asn Trp Glu Lys lie Glu Gly Lys Glu Asn Arg Val Arg Val Leu Asp 
340 345 350 

Gly Tyr Lys Gly Asn Thr His Met Gly Gly Trp Asn Thr Gly Asn Asn 
355 360 365 

Trp lie Leu Tyr lie Asn Glu Asn Val Thr Asp Gin Gin lie Glu Asn 
370 375 380 

Ser Lys Lys Glu Leu Ala Glu Ala Lys Glu Ser Pro Ala Leu Gly Phe 
385 390 395 400 
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He Phe Asn Thr Asp Asn Val Lys Ser Glu He Ser Ala He Ala Asn 
405 410 415 

Thr Met Gin Gin Phe Asp Thr Ala He Asn Thr Gly Thr Val Asp Pro 
420 425 430 

Asp Lys Ala He Pro Glu Leu Met Glu Lys Leu Lys Ser Glu Gly Ala 
435 440 445 

Tyr Glu Lys Val Leu Asn Glu Met Gin Lys Gin Tyr Asp Glu Phe Leu 
450 455 460 

Lys Asn Lys Lys 
465 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 937 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TGGTCAAGGA ACTGCTTCTA AAGACAACAA AGAGGCAGAA CTTAAGAAGG TTGACTTTAT 60 

CCTAGACTGG ACACCAAATA CCAACCACAC AGGGCTTTAT GTTGCCAAGG AAAAAGGTTA 120 

TTTCAAAGAA GCTGGAGTGG ATGTTGATTT GAAATTGCCA CCAGAAGAAA GTTCTTCTGA 180 

CTTGGTTATC AACGGAAAGG CACCATTTGC AGTGTATTTC CAAGACTACA TGGCTAAGAA 240 

ATTGGAAAAA GGAGCAGGAA TCACTGCCGT TGCAGCTATT GTTGAACACA ATACATCAGG 300 

AATCATCTCT CGTAAATCTG ATAATGTAAG CAGTCCAAAA GACTTGGTTG GTAAGAAATA 360 

TGGGACATGG AATGACCCAA CTGAACTTGC TATGTTGAAA ACCTTGGTAG AATCTCAAGG 420 

TGGAGACTTT GAGAAGGTTG AAAAAGTACC AAATAACGAC TCAAACTCAA TCACACCGAT 480 

TGCCAATGGC GTCTTTGATA CTGCTTGGAT TTACTACGGT TGGGATGGTA TCCTTGCTAA 540 

ATCTCAAGGT GTAGATGCTA ACTTCATGTA CTTGAAAGAC TATGTCAAGG AGTTTGACTA 600 

CTATTCACCA GTTATCATCG CAAACAACGA CTATCTGAAA GATAACAAAG AAGAAGCTCG 660 

CAAAGTCATC CAAGCCATCA AAAAAGGCTA CCAATATGCC ATGGAACATC CAGAAGAAGC 720 

TGCAGATATT CTCATCAAGA ATGCACCTGA ACTCAAGGAA AAACGTGACT TTGTCATCGA 7 80 

ATCTCAAAAA TACTTGTCAA AAGAATACGC AAGCGACAAG GAAAAATGGG GTCAATTTGA 840 

CGCAGCTCGC TGGAATGCTT TCTACAAATG GGATAAAGAA AATGGTATCC TTAAAGAAGA 900 

CTTGACAGAC AAAGGCTTCA CCAACGAATT TGTGAAA 937 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gly Gin Gly Thr Ala Ser Lys Asp Asn Lys Glu Ala Glu Leu Lys Lys 
15 10 15 

Val Asp Phe lie Leu Asp Trp Thr Pro Asn Thr Asn His Thr Gly Leu 
20 25 30 

Tyr Val Ala Lys Glu Lys Gly Tyr Phe Lys Glu Ala Gly Val Asp Val 
35 40 45 

Asp Leu Lys Leu Pro Pro Glu Glu Ser Ser Ser Asp Leu Val lie Asn 
50 55 60 

Gly Lys Ala Pro Phe Ala Val Tyr Phe Gin Asp Tyr Met Ala Lys Lys 
65 70 75 80 

Leu Glu Lys Gly Ala Gly lie Thr Ala Val Ala Ala lie Val Glu His 
85 90 95 

Asn Thr Ser Gly lie lie Ser Arg Lys Ser Asp Asn Val Ser Ser Pro 
100 105 110 

Lys Asp Leu Val Gly Lys Lys Tyr Gly Thr Trp Asn Asp Pro Thr Glu 
115 120 125 

Leu Ala Met Leu Lys Thr Leu Val Glu Ser Gin Gly Gly Asp Phe Glu 
130 135 140 

Lys Val Glu Lys Val Pro Asn Asn Asp Ser Asn Ser lie Thr Pro lie 
145 150 155 160 

Ala Asn Gly Val Phe Asp Thr Ala Trp lie Tyr Tyr Gly Trp Asp Gly 
165 170 175 

lie Leu Ala Lys Ser Gin Gly Val Asp Ala Asn Phe Met Tyr Leu Lys 
180 185 190 

Asp Tyr Val Lys Glu Phe Asp Tyr Tyr Ser Pro Val lie lie Ala Asn 
195 200 205 

Asn Asp Tyr Leu Lys Asp Asn Lys Glu Glu Ala Arg Lys Val lie Gin 
210 215 220 

Ala lie Lys Lys Gly Tyr Gin Tyr Ala Met Glu His Pro Glu Glu Ala 
225 230 235 240 

Ala Asp lie Leu lie Lys Asn Ala Pro Glu Leu Lys Glu Lys Arg Asp 
245 250 255 

Phe Val lie Glu Ser Gin Lys Tyr Leu Ser Lys Glu Tyr Ala Ser Asp 
260 265 270 

Lys Glu Lys Trp Gly Gin Phe Asp Ala Ala Arg Trp Asn Ala Phe Tyr 
275 280 285 

Lys Trp Asp Lys Glu Asn Gly lie Leu Lys Glu Asp Leu Thr Asp Lys 
290 295 300 



1 
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Gly Phe Thr Asn Glu Phe Val Lys 
305 310 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



TAGCTCAGGT 


GGAAACGCTG 


GTTCATCCTC 


TGGAAAAACA 


ACTGCCAAAG 


CTCGCACTAT 


60 


CGATGAAATC 


AAAAAAAGCG 


GTGAACTGCG 


AATCGCCGTG 


TTTGGAGATA 


AAAAACCGTT 


120 


TGGCTACGTT 


GACAATGATG 


GTTCTACCAA 


GGTACGCTAC 


GATATTGAAC 


TAGGGAACCA 


180 


ACTAGCTCAA 


GACCTTGGTG 


TCAAGGTTAA 


ATACATTTCA 


GTCGATGCTG 


CCAACCGTGC 


240 


GGAATACTTG 


ATTTCAAACA 


AGGTAGATAT 


TACTCTTGCT 


AACTTTACAG 


TAACTGACGA 


300 


ACGTAAGAAA 


CAAGTTGATT 


TTGCCCTTCC 


ATATATGAAA 


GTTTCTCTGG 


GTGTCGTATC 


360 


ACCTAAGACT 


GGTCTCATTA 


CAGACGTCAA 


ACAACTTGAA 


GGTAAAACCT 


TAATTGTCAC 


420 


AAAAGGAACG 


ACTGCTGAGA 


CTTATTTTGA 


AAAGAATCAT 


CCAGAAATCA 


AACTCCAAAA 


480 


ATACGACCAA 


TACAGTGACT 


CTTACCAAGC 


TCTTCTTGAC 


GGACGTGGAG 


ATGCCTTTTC 


540 


AACTGACAAT 


ACGGAAGTTC 


TAGCTTGGGC 


GCTTGAAAAT 


AAAGGATTTG 


AAGTAGGAAT 


600 


TACTTCCCTC 


GGTGATCCCG 


ATACCATTGC 


GGCAGCAGTT 


CAAAAAGGCA 


ACCAAGAATT 


660 


GCTAGACTTC 


ATCAATAAAG 


ATATTGAAAA 


ATTAGGCAAG 


GAAAACTTCT 


TCCACAAGGC 


720 


CTATGAAAAG 


ACACTTCACC 


CAACCTACGG 


TGACGCTGCT 


AAAGCAGATG 


ACCTGGTTGT 


780 


TGAAGGTGGA 


AAAGTTGAT 










799 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Ser Gly Gly Asn Ala Gly Ser Ser Ser Gly Lys Thr Thr Ala Lys 
15 10 15 

Ala Arg Thr lie Asp Glu lie Lys Lys Ser Gly Glu Leu Arg lie Ala 
20 25 30 

Val Phe Gly Asp Lys Lys Pro Phe Gly Tyr Val Asp Asn Asp Gly Ser 
35 40 45 
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Thr Lys Val Arg Tyr Asp lie Glu Leu Gly Asn Gin Leu Ala Gin Asp 
50 55 60 

Leu Gly Val Lys Val Lys Tyr lie Ser Val Asp Ala Ala Asn Arg Ala 
65 70 75 80 

Glu Tyr Leu lie Ser Asn Lys Val Asp lie Thr Leu Ala Asn Phe Thr 
85 90 95 

Val Thr Asp Glu Arg Lys Lys Gin Val Asp Phe Ala Leu Pro Tyr Met 
100 105 110 

Lys Val Ser Leu Gly Val Val Ser Pro Lys Thr Gly Leu lie Thr Asp 
115 120 125 

Val Lys Gin Leu Glu Gly Lys Thr Leu lie Val Thr Lys Gly Thr Thr 
130 135 140 

Ala Glu Thr Tyr Phe Glu Lys Asn His Pro Glu lie Lys Leu Gin Lys 
145 150 155 160 

Tyr Asp Gin Tyr Ser Asp Ser Tyr- Gin Ala Leu Leu Asp Gly Arg Gly 
165 170 175 

Asp Ala Phe Ser Thr Asp Asn Thr Glu Val Leu Ala Trp Ala Leu Glu 
180 185 190 

Asn Lys Gly Phe Glu Val Gly lie Thr Ser Leu Gly Asp Pro Asp Thr 
195 200 205 

lie Ala Ala Ala Val Gin Lys Gly Asn Gin Glu Leu Leu Asp Phe lie 
210 215 220 

Asn Lys Asp lie Glu Lys Leu Gly Lys Glu Asn Phe Phe His Lys Ala 
225 230 235 240 

Tyr Glu Lys Thr Leu His Pro Thr Tyr Gly Asp Ala Ala Lys Ala Asp 
245 250 255 

Asp Leu Val Val Glu Gly Gly Lys Val Asp 
260 265 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1189 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTCCAACTAT GGTAAATCTG CGGATGGCAC AGTGACCATC GAGTATTTCA ACCAGAAAAA 60 

AGAAATGACC AAAACCTTGG AAGAAATCAC TCGTGATTTT GAGAAGGAAA ACCCTAAGAT 120 

CAAGGTCAAA GTCGTCAATG TACCAAATGC TGGTGAAGTA TTGAAGACAC GCGTTCTCGC 180 

AGGAGATGTG CCTGATGTGG TCAATATTTA CCCACAGTCC ATCGAACTGC AAGAATGGGC 240 

AAAAGCAGGT GTTTTTGAAG ATTTGAGCAA CAAAGACTAC CTGAAACGCG TGAAAAATGG 300 
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A A A Ti A T'/^/^T'r^ 

AAA i A i VjU i yj 


nPAAAPPAAAZV 
± AAr\\^tjira/-U-ii-i. 


AriTTTAPAAP 


f^TTPPTTTTA 

\jrX XV^V^X X X Xrt. 


PAnPTAATPP 


^ fin 

J O VJ 


rprp 7\ f~r\/^/~* 7\ A rprri 
i 1 A 1 tjtjAA i 1 


i AL- i AL, AAL-A 


A AP ATZi A ATT 


PPA APA APTP 


nr;pTTn A Ann 


TTPPTPAAAP 


420 






TAnTPa A AHA 


TATPnTTHPT 

XrlXV_oX X vjV^ X 


A A Ann AP AAA 


PAPPATTTnn 


480 




LiL- AVjA 1 oL. 1 1 


rr* AP APTP A A 


TnnTTAPA AT 


P A ATTAnPPT 


TTnpn AP APP 


540 


A A/^A^i^Ti^^OA 


/^i^ AAAA^^AAO 

GGAAAALrAAlj 


P A A A TP A A T A 
AAA 1 L-ri/i i t\ 


PPTTPPTTAT 
X^oX Xrt.X 


TPTPA APPA A 


ATPPP ATTA A 


o u u 


Ai italLCjtjAi 




A AP ATP AT AT 
AA^jA i 1 /ii 


PA APPTP ATP 


P AP ATPPTTP 


nP ATP A ATPn 


D D Vj 


ATL 1 AACjL AA 


A A O A A Ti/^ O O 

AAGAAL i LtGvj 


A APPTPPTPP 


PTATAPPP AT 


pTTATpnnAn 


ppTTpnp APn 


79 0 


TGGGGATGTC 


L 1 L.A i GAL, AU 


P A A A TPPPTP 

G AAA i GGo 1 


TTPPPPP ATP 


AP APPnATTA 


A TP A A P A A A A 


7ft n 


ALLGAAL Pi i 


AAGA 1 1 GGGA 


PPTiTP ATP AT 


TPP APP A A A A 


PAAAAAPPAP 


A A APPTT A AP 


O *± V 


LGl 1 GGXGLG 


GGAGAL 1 i GG 


P A Tip P Tip T> A T" 

G A i VjLi 1 L. 1 a J. 


PTP APPP APP 


APP A A AP ATP 


PA A A AP A APP 


-7 U V/ 


LAATGLL I I i 


GiGGAAJ-Al A 


•PP APPPPTPP 


AP A APTP ATP 


PA A A A AT APT 


APP A TP TPP A 


-7 D U 




A A p rr* A 

AL- ALjU o a ± L. 


A APPPPTP A A 


AP A AnpAnriA 


nA AnATTPAP 


pnPTTPPTnn 


1020 


TATGACCGAA 


TATGCCTTTA 


CGGATCGTCA 


CTTGGTCTGG 


TTGCAACAAT 


ACTGGACCAG 


1080 


TGAAGCAGAC 


TTCCATACCT 


TGACCATGAA 


CTATGTCTTG 


ACCGGTGATA 


AACAAGGCAT 


1140 


GGTCAATGAT 


TTGAATGCCT 


TCTTTAACCC 


GATGAAAGCG 


GATGTGGAT 




1189 


(2) INFORMATION FOR SEQ ID NO: 16: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ser Asn Tyr Gly Lys Ser Ala Asp Gly Thr Val Thr lie Glu Tyr Phe 
15 10 15 

Asn Gin Lys Lys Glu Met Thr Lys Thr Leu Glu Glu lie Thr Arg Asp 
20 25 30 

Phe Glu Lys Glu Asn Pro Lys lie Lys Val Lys Val Val Asn Val Pro 
35 40 45 

Asn Ala Gly Glu Val Leu Lys Thr Arg Val Leu Ala Gly Asp Val Pro 
50 55 60 

Asp Val Val Asn lie Tyr Pro Gin Ser lie Glu Leu Gin Glu Trp Ala 
65 70 75 80 

Lys Ala Gly Val Phe Glu Asp Leu Ser Asn Lys Asp Tyr Leu Lys Arg 
85 90 95 

Val Lys Asn Gly Tyr Ala Glu Lys Tyr Ala Val Asn Glu Lys Val Tyr 
100 105 110 



19 



Asn Val Pro Phe Thr Ala Asn Ala Tyr Gly lie Tyr Tyr Asn Lys Asp 
115 120 125 

Lys Phe Glu Glu Leu Gly Leu Lys Val Pro Glu Thr Trp Asp Glu Phe 
130 135 140 

Glu Gin Leu Val Lys Asp lie Val Ala Lys Gly Gin Thr Pro Phe Gly 
145 150 155 160 

lie Ala Gly Ala Asp Ala Trp Thr Leu Asn Gly Tyr Asn Gin Leu Ala 
165 170 175 

Phe Ala Thr Ala Thr Gly Gly Gly Lys Glu Ala Asn Gin Tyr Leu Arg 
180 185 190 

Tyr Ser Gin Pro Asn Ala lie Lys Leu Ser Asp Pro lie Met Lys Asp 
195 200 205 

Asp lie Lys Val Met Asp lie Leu Arg lie Asn Gly Ser Lys Gin Lys 
210 215 220 

Asn Trp Glu Gly Ala Gly Tyr Thr Asp Val lie Gly Ala Phe Ala Arg 
225 230 235 240 

Gly Asp Val Leu Met Thr Pro Asn Gly Ser Trp Ala lie Thr Ala lie 
245 250 255 

Asn Glu Gin Lys Pro Asn Phe Lys lie Gly Thr Phe Met lie Pro Gly 
260 265 270 

Lys Glu Lys Gly Gin Ser Leu Thr Val Gly Ala Gly Asp Leu Ala Trp 
275 280 285 

Ser lie Ser Ala Thr Thr Lys His Pro Lys Glu Ala Asn Ala Phe Val 
290 295 300 

Glu Tyr Met Thr Arg Pro Glu Val Met Gin Lys Tyr Tyr Asp Val Asp 
305 310 315 320 

Gly Ser Pro Thr Ala lie Glu Gly Val Lys Gin Ala Gly Glu Asp Ser 
325 330 335 

Pro Leu Ala Gly Met Thr Glu Tyr Ala Phe Thr Asp Arg His Leu Val 
340 345 350 

Trp Leu Gin Gin Tyr Trp Thr Ser Glu Ala Asp Phe His Thr Leu Thr 
355 360 365 

Met Asn Tyr Val Leu Thr Gly Asp Lys Gin Gly Met Val Asn Asp Leu 
370 375 380 

Asn Ala Phe Phe Asn Pro Met Lys Ala Asp Val Asp 
385 390 395 

INFORMATION FOR SEQ . ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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TGGGAAAAAT TCTAGCGAAA CTAGTGGAGA TAATTGGTCA AAGTACCAGT CTAACAAGTC 60 

TATTACTATT GGATTTGATA GTACTTTTGT TCCAATGGGA TTTGCTCAGA AAGATGGTTC 120 

TTATGCAGGA TTTGATATTG ATTTAGCTAC AGCTGTTTTT GAAAAATACG GAATCACGGT 180 

AAATTGGCAA CCGATTGATT GGGATTTGAA AGAAGCTGAA TTGACAAAAG GAACGATTGA 240 

TCTGATTTGG AATGGCTATT CCGCTACAGA CGAACGCCGT GAAAAGGTGG CTTTCAGTAA 300 

CTCATATATG AAGAATGAGC AGGTATTGGT TACGAAGAAA TCATCTGGTA TCACGACTGC 360 

AAAGGATATG ACTGGAAAGA CATTAGGAGC TCAAGCTGGT TCATCTGGTT ATGCGGACTT 420 

TGAAGCAAAT CCAGAAATTT TGAAGAATAT TGTCGCTAAT AAGGAAGCGA ATCAATACCA 480 

AACCTTTAAT GAAGCCTTGA TTGATTTGAA AAACGATCGA ATTGATGGTC TATTGATTGA 540 

CCGTGTCTAT GCAAACTATT ATTTAGAAGC AGAAGGTGTT TTAAACGATT ATAATGTCTT 600 

TACAGTTGGA CTAGAAACAG AAGCTTTTGC GGTTGGAGCC CGTAAGGAAG ATACAAACTT 660 

GGTTAAGAAG ATAAATGAAG CTTTTTCTAG TCTTTACAAG GACGGCAAGT TCCAAGAAAT 720 

CAGCCAAAAA TGGTTTGGAG AAGATGTAGC AACCAAAGAA GTAAAAGAAG GACAG 775 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gly Lys Asn Ser Ser Glu Thr Ser Gly Asp Asn Trp Ser Lys Tyr Gin 
15 10 15 

Ser Asn Lys Ser lie Thr lie Gly Phe Asp Ser Thr Phe Val Pro Met 
20 25 30 

Gly Phe Ala Gin Lys Asp Gly Ser Tyr Ala Gly Phe Asp lie Asp Leu 
35 40 45 

Ala Thr Ala Val Phe Glu Lys Tyr Gly lie Thr Val Asn Trp Gin Pro 
50 55 60 

lie Asp Trp Asp Leu Lys Glu Ala Glu Leu Thr Lys Gly Thr lie Asp 
65 70 75 80 

Leu lie Trp Asn Gly Tyr Ser Ala Thr Asp Glu Arg Arg Glu Lys Val 
85 90 95 

Ala Phe Ser Asn Ser Tyr Met Lys Asn Glu Gin Val Leu Val Thr Lys 
100 105 110 

Lys Ser Ser Gly lie Thr Thr Ala Lys Asp Met Thr Gly Lys Thr Leu 
115 120 125 
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Gly Ala Gin Ala Gly Ser Ser Gly Tyr Ala Asp Phe Glu Ala Asn Pro 
130 135 140 

Glu lie Leu Lys Asn lie Val Ala Asn Lys Glu Ala Asn Gin Tyr Gin 
145 150 155 160 

Thr Phe Asn Glu Ala Leu lie Asp Leu Lys Asn Asp Arg lie Asp Gly 
165 170 175 

Leu Leu lie Asp Arg Val Tyr Ala Asn Tyr Tyr Leu Glu Ala Glu Gly 
180 185 190 

Val Leu Asn Asp Tyr Asn Val Phe Thr Val Gly Leu Glu Thr Glu Ala 
195 200 205 

Phe Ala Val Gly Ala Arg Lys Glu Asp Thr Asn Leu Val Lys Lys lie 
210 215 220 

Asn Glu Ala Phe Ser Ser Leu Tyr Lys Asp Gly Lys Phe Gin Glu lie 
225 230 235 240 

Ser Gin Lys Trp Phe Gly Glu Asp Val Ala Thr Lys Glu Val Lys Glu 
245 250 255 



Gly Gin 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



TGCTAGCGGA 


AAAAAAGATA 


CAACTTCTGG 


TCAAAAACTA 


AAAGTTGTTG 


CTACAAACTC 


60 


AATCATCGCT 


GATATTACTA 


AAAATATTGC 


TGGTGACAAA 


ATTGACCTTC 


ATAGTATCGT 


120 


TCCGATTGGG 


CAAGACCCAC 


ACGAATACGA 


ACCACTTCCT 


GAAGACGTTA 


AGAAAACTTC 


180 


TGAGGCTAAT 


TTGATTTTCT 


ATAACGGTAT 


CAACCTTGAA 


ACAGGTGGCA 


ATGCTTGGTT 


240 


TACAAAATTG 


GTAGAAAATG 


CCAAGAAAAC 


TGAAAACAAA 


GACTACTTCG 


CAGTCAGCGA 


300 


CGGCGTTGAT 


GTTATCTACC 


TTGAAGGTCA 


AAATGAAAAA 


GGAAAAGAAG 


ACCCACACGC 


360 


TTGGCTTAAC 


CTTGAAAACG 


GTATTATTTT 


TGCTAAAAAT 


ATCGCCAAAC 


AATTGAGCGC 


420 


CAAAGACCCT 


AACAATAAAG 


AATTCTATGA 


AAAAAATCTC 


AAAGAATATA 


CTGATAAGTT 


480 


AGACAAACTT 


GATAAAGAAA 


GTAAGGATAA 


ATTTAATAAG 


ATCCCTGCTG 


AAAAGAAACT 


540 


CATTGTAACC 


AGCGAAGGAG 


CATTCAAATA 


CTTCTCTAAA 


GCCTATGGTG 


TCCCAAGTGC 


600 


TTACATCTGG 


GAAATCAATA 


CTGAAGAAGA 


AGGAACTCCT 


GAACAAATCA 


AGACCTTGGT 


660 


TGAAAAACTT 


CGCCAAACAA 


AAGTTCCATC 


ACTCTTTGTA 


GAATCAAGTG 


TGGATGACCG 


720 


TCCAATGAAA 


ACTGTTTCTC 


AAGACACAAA 


CATCCCAATC 


TACGCTCAAA 


TCTTTACTGA 


780 
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CTCTATCGCA GAACAAGGTA AAGAAGGCGA CAGCTACTAC AGCATGATGA AATACAACCT 840 
TGACAAGATT GCTGAAGGAT TGGCAAAA 868 
(2) INFORMATION FOR SEQ ID NO: 20: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Ala Ser Gly Lys Lys Asp Thr Thr Ser Gly Gin Lys Leu Lys Val Val 
15 10 15 

Ala Thr Asn Ser lie lie Ala Asp lie Thr Lys Asn lie Ala Gly Asp 
20 25 30 

Lys lie Asp Leu His Ser lie Val Pro lie Gly Gin Asp Pro His Glu 
35 40 45 

Tyr Glu Pro Leu Pro Glu Asp Val Lys Lys Thr Ser Glu Ala Asn Leu 
50 55 60 

lie Phe Tyr Asn Gly lie Asn Leu Glu Thr Gly Gly Asn Ala Trp Phe 
65 70 75 80 

Thr Lys Leu Val Glu Asn Ala Lys Lys Thr Glu Asn Lys Asp Tyr Phe 
85 90 95 

Ala Val Ser Asp Gly Val Asp Val lie Tyr Leu Glu Gly Gin Asn Glu 
100 105 110 

Lys Gly Lys Glu Asp Pro His Ala Trp Leu Asn Leu Glu Asn Gly lie 
115 120 125 

lie Phe Ala Lys Asn lie Ala Lys Gin Leu Ser Ala Lys Asp Pro Asn 
130 135 140 

Asn Lys Glu Phe Tyr Glu Lys Asn Leu Lys Glu Tyr Thr Asp Lys Leu 
145 150 155 160 

Asp Lys Leu Asp Lys Glu Ser Lys Asp Lys Phe Asn Lys lie Pro Ala 
165 170 175 

Glu Lys Lys Leu lie Val Thr Ser Glu Gly Ala Phe Lys Tyr Phe Ser 
180 185 190 

Lys Ala Tyr Gly Val Pro Ser Ala Tyr lie Trp Glu lie Asn Thr Glu 
195 200 205 

Glu Glu Gly Thr Pro Glu Gin lie Lys Thr Leu Val Glu Lys Leu Arg 
210 215 220 

Gin Thr Lys Val Pro Ser Leu Phe Val Glu Ser Ser Val Asp Asp Arg 
225 230 235 240 

Pro Met Lys Thr Val Ser Gin Asp Thr Asn lie Pro lie Tyr Ala Gin 
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245 250 255 

lie Phe Thr Asp Ser lie Ala Glu Gin Gly Lys Glu Gly Asp Ser Tyr 
260 265 270 

Tyr Ser Met Met Lys Tyr Asn Leu Asp Lys lie Ala Glu Gly Leu Ala 
275 280 285 

Lys 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



TGGCTCAAAA 


AATACAGCTT 


CAAGTCCAGA 


TTATAAGTTG 


GAAGGTGTAA 


CATTCCCGCT 


60 


TCAAGAAAAG 


AAAACATTGA 


AGTTTATGAC 


AGCCAGTTCA 


CCGTTATCTC 


CTAAAGACCC 


120 


AAATGAAAAG 


TTAATTTTGC 


AACGTTTGGA 


GAAGGAAACT 


GGCGTTCATA 


TTGACTGGAC 


180 


CAACTACCAA 


TCCGACTTTG 


CAGAAAAACG 


TAACTTGGAT 


ATTTCTAGTG 


GTGATTTACC 


240 


AGATGCTATC 


CACAACGACG 


GAGCTTCAGA 


TGTGGACTTG 


ATGAACTGGG 


CTAAAAAAGG 


300 


TGTTATTATT 


CCAGTTGAAG 


ATTTGATTGA 


TAAATACATG 


CCAAATCTTA 


AGAAAATTTT 


360 


GGATGAGAAA 


CCAGAGTACA 


AGGCCTTGAT 


GACAGCACCT 


GATGGGCACA 


TTTACTCATT 


420 


TCCATGGATT 


GAAGAGCTTG 


GAGATGGTAA 


AGAGTCTATT 


CACAGTGTCA 


ACGATATGGC 


480 


TTGGATTAAC 


AAAGATTGGC 


TTAAGAAACT 


TGGTCTTGAA 


ATGCCAAAAA 


CTACTGATGA 


540 


TTTGATTAAA 


GTCCTAGAAG 


CTTTCAAAAA 


CGGGGATCCA 


AATGGAAATG 


GAGAGGCTGA 


600 


TGAAATTCCA 


TTTTCATTTA 


TTAGTGGTAA 


CGGAAACGAA 


GATTTTAAAT 


TCCTATTTGC 


660 


TGCATTTGGT 


ATAGGGGATA 


ACGATGATCA 


TTTAGTAGTA 


GGAAATGATG 


GCAAAGTTGA 


720 


CTTCACAGCA 


GATAACGATA 


ACTATAAAGA 


AGGTGTCAAA 


TTTATCCGTC 


AATTGCAAGA 


780 


AAAAGGCCTG 


ATTGATAAAG 


AAGCTTTCGA 


ACATGATTGG 


AATAGTTACA 


TTGCTAAAGG 


840 


TCATGATCAG 


AAATTTGGTG 


TTTACTTTAC 


ATGGGATAAG 


AATAATGTTA 


CTGGAAGTAA 


900 


CGAAAGTTAT 


GATGTTTTAC 


CAGTACTTGC 


TGGACCAAGT 


GGTCAAAAAC 


ACGTAGCTCG 


960 


TACAAACGGT 


ATGGGATTTG 


CACGTGACAA 


GATGGTTATT 


ACCAGTGTAA 


ACAAAAACCT 


1020 


AGAATTGACA 


GCTAAATGGA 


TTGATGCACA 


ATACGCTCCA 


CTCCAATCTG 


TGCAAAATAA 


1080 


CTGGGGAACT 


TACGGAGATG 


ACAAACAACA 


AAACATCTTT 


GAATTGGATC 


AAGCGTCAAA 


1140 


TAGTCTAAAA 


CACTTACCAC 


TAAACGGAAC 


TGCACCAGCA 


GAACTTCGTC 


AAAAGACTGA 


1200 


AGTAGGAGGA 


CCACTAGCTA 


TCCTAGATTC 


ATACTATGGT 


AAAGTAACAA 


CCATGCCTGA 


1260 
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TGATGCCAAA TGGCGTTTGG ATCTTATCAA AGAATATTAT GTTCCTTACA TGAGCAATGT 1320 

CAATAACTAT CCAAGAGTCT TTATGACACA GGAAGATTTG GACAAGATTG CCCATATCGA 1380 

AGCAGATATG AATGACTATA TCTACCGTAA ACGTGCTGAA TGGATTGTAA ATGGCAATAT 1440 

TGATACTGAG TGGGATGATT ACAAGAAAGA ACTTGAAAAA TACGGACTTT CTGATTACCT 1500 

CGCTATTAAA CAAAAATACT ACGACCAATA CCAAGCAAAC AAAAAC 1546 
(2) INFORMATION FOR SEQ . ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Gly Ser Lys Asn Thr Ala Ser Ser Pro Asp Tyr Lys Leu Glu Gly Val 
15 10 15 

Thr Phe Pro Leu Gin Glu Lys Lys Thr Leu Lys Phe Met Thr Ala Ser 
20 25 30 

Ser Pro Leu Ser Pro Lys Asp Pro Asn Glu Lys Leu lie Leu Gin Arg 
35 40 45 

Leu Glu Lys Glu Thr Gly Val His lie Asp Trp Thr Asn Tyr Gin Ser 
50 55 60 

Asp Phe Ala Glu Lys Arg Asn Leu Asp lie Ser Ser Gly Asp Leu Pro 
65 70 75 80 

Asp Ala lie His Asn Asp Gly Ala Ser Asp Val Asp Leu Met Asn Trp 
85 90 95 

Ala Lys Lys Gly Val lie lie Pro Val Glu Asp Leu lie Asp Lys Tyr 
100 105 110 

Met Pro Asn Leu Lys Lys lie Leu Asp Glu Lys Pro Glu Tyr Lys Ala 
115 120 125 

Leu Met Thr Ala Pro Asp Gly His lie Tyr Ser Phe Pro Trp lie Glu 
130 135 140 

Glu Leu Gly Asp Gly Lys Glu Ser lie His Ser Val Asn Asp Met Ala 
145 150 155 160 

Trp lie Asn Lys Asp Trp Leu Lys Lys Leu Gly Leu Glu Met Pro Lys 
165 170 175 

Thr Thr Asp Asp Leu lie Lys Val Leu Glu Ala Phe Lys Asn Gly Asp 
180 185 190 

Pro Asn Gly Asn Gly Glu Ala Asp Glu lie Pro Phe Ser Phe lie Ser 
195 200 205 

Gly Asn Gly Asn Glu Asp Phe Lys Phe Leu Phe Ala Ala Phe Gly lie 
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210 215 220 

Gly Asp Asn Asp Asp His Leu Val Val Gly Asn Asp Gly Lys Val Asp 
225 230 235 240 

Phe Thr Ala Asp Asn Asp Asn Tyr Lys Glu Gly Val Lys Phe lie Arg 
245 250 255 

Gin Leu Gin Glu Lys Gly Leu lie Asp Lys Glu Ala Phe Glu His Asp 
260 265 270 

Trp Asn Ser Tyr lie Ala Lys Gly His Asp Gin Lys Phe Gly Val Tyr 
275 280 285 

Phe Thr Trp Asp Lys Asn Asn Val Thr Gly Ser Asn Glu Ser Tyr Asp 
290 295 300 

Val Leu Pro Val Leu Ala Gly Pro Ser Gly Gin Lys His Val Ala Arg 
305 310 315 320 

Thr Asn Gly Met Gly Phe Ala Arg Asp Lys Met Val lie Thr Ser Val 
325 330 335 

Asn Lys Asn Leu Glu Leu Thr Ala Lys Trp lie Asp Ala Gin Tyr Ala 
340 345 350 

Pro Leu Gin Ser Val Gin Asn Asn Trp Gly Thr Tyr Gly Asp Asp Lys 
355 360 365 

Gin Gin Asn lie Phe Glu Leu Asp Gin Ala Ser Asn Ser Leu Lys His 
370 375 380 

Leu Pro Leu Asn Gly Thr Ala Pro Ala Glu Leu Arg Gin Lys Thr Glu 
385 390 395 400 

Val Gly Gly Pro Leu Ala lie Leu Asp Ser Tyr Tyr Gly Lys Val Thr 
405 410 415 

Thr Met Pro Asp Asp Ala Lys Trp Arg Leu Asp Leu lie Lys Glu Tyr 
420 425 430 

Tyr Val Pro Tyr Met Ser Asn Val Asn Asn Tyr Pro Arg Val Phe Met 
435 440 445 

Thr Gin Glu Asp Leu Asp Lys lie Ala His lie Glu Ala Asp Met Asn 
450 455 460 

Asp Tyr lie Tyr Arg Lys Arg Ala Glu Trp lie Val Asn Gly Asn lie 
465 470 475 480 

Asp Thr Glu Trp Asp Asp Tyr Lys Lys Glu Leu Glu Lys Tyr Gly Leu 
485 490 495 

Ser Asp Tyr Leu Ala lie Lys Gin Lys Tyr Tyr Asp Gin Tyr Gin Ala 
500 505 510 

Asn Lys Asn 
515 

INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 
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( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TAGTACAAAC TCAAGCACTA GTCAGACAGA GACCAGTAGC TCTGCTCCAA CAGAGGTAAC 60 

CATTAAAAGT TCACTGGACG AGGTCAAACT TTCCAAAGTT CCTGAAAAGA TTGTGACCTT 120 

TGACCTCGGC GCTGCGGATA CTATTCGCGC TTTAGGATTT GAAAAAAATA TCGTCGGAAT 180 

GCCTACAAAA ACTGTTCCGA CTTATCTAAA AGACCTAGTG GGAACTGTCA AAAATGTTGG 240 

TTCTATGAAA GAACCTGATT TAGAAGCTAT CGCCGCCCTT GAGCCTGATT TGATTATCGC 300 

TTCGCCACGT ACACAAAAAT TCGTAGACAA ATTCAAAGAA ATCGCCCCAA CCGTTCTCTT 360 

CCAAGCAAGC AAGGACGACT ACTGGACTTC TACCAAGGCT AATATCGAAT CCTTAGCAAG 420 

TGCCTTCGGC GAAACTGGTA CACAGAAAGC CAAGGAAGAA TTGACCAAGC TAGACAAGAG 480 

CATCCAAGAA GTCGCTACTA AAAATGAAAG CTCTGACAAA AAAGCCCTTG CGATCCTCCT 540 

TAATGAAGGA AAAATGGCAG CCTTTGGTGC CAAATCTCGT TTCTCTTTCT TGTACCAAAC 600 

CTTGAAATTC AAACCAACTG ATACAAAATT TGAAGACTCA CGCCACGGAC AAGAAGTCAG 660 

CTTTGAAAGT GTCAAAGAAA TCAACCCTGA CATCCTCTTT GTCATCAACC GTACCCTTGC 720 

CATCGGTGGG GACAACTCTA GCAACGACGG TGTCCTAGAA AATGCCCTTA TCGCTGAAAC 780 

ACCTGCTGCT AAAAATGGTA AGATTATCCA ACTAACACCA GACCTCTGGT ATCTAAGCGG 840 

AGGCGGACTT GAATCAACAA AACTCATGAT TGAAGACATA CAAAAAGGTT TGAAA 895 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Ser Thr Asn Ser Ser Thr Ser Gin Thr Glu Thr Ser Ser Ser Ala Pro 
15 10 15 

Thr Glu Val Thr lie Lys Ser Ser Leu Asp Glu Val Lys Leu Ser Lys 
20 25 30 

Val Pro Glu Lys lie Val Thr Phe Asp Leu Gly Ala Ala Asp Thr lie 
35 40 45 

Arg Ala Leu Gly Phe Glu Lys Asn lie Val Gly Met Pro Thr Lys Thr 
50 55 60 

Val Pro Thr Tyr Leu Lys Asp Leu Val Gly Thr Val Lys Asn Val Gly 
65 70 75 80 
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Ser Met Lys Glu Pro Asp Leu Glu Ala lie Ala Ala Leu Glu Pro Asp 
85 90 95 

Leu lie lie Ala Ser Pro Arg Thr Gin Lys Phe Val Asp Lys Phe Lys 
100 105 110 

Glu lie Ala Pro Thr Val Leu Phe Gin Ala Ser Lys Asp Asp Tyr Trp 
115 120 125 

Thr Ser Thr Lys Ala Asn lie Glu Ser Leu Ala Ser Ala Phe Gly Glu 
130 135 140 

Thr Gly Thr Gin Lys Ala Lys Glu Glu Leu Thr Lys Leu Asp Lys Ser 
145 150 155 160 

lie Gin Glu Val Ala Thr Lys Asn Glu Ser Ser Asp Lys Lys Ala Leu 
165 170 175 

Ala lie Leu Leu Asn Glu Gly Lys Met Ala Ala Phe Gly Ala Lys Ser 
180 185 190 

Arg Phe Ser Phe Leu Tyr Gin Thr Leu Lys Phe Lys Pro Thr Asp Thr 
195 200 205 

Lys Phe Glu Asp Ser Arg His Gly Gin Glu Val Ser Phe Glu Ser Val 
210 215 220 

Lys Glu lie Asn Pro Asp lie Leu Phe Val He Asn Arg Thr Leu Ala 
225 230 235 240 

He Gly Gly Asp Asn Ser Ser Asn Asp Gly Val Leu Glu Asn Ala Leu 
245 250 255 

He Ala Glu Thr Pro Ala Ala Lys Asn Gly Lys He He Gin Leu Thr 
260 265 270 

Pro Asp Leu Trp Tyr Leu Ser Gly Gly Gly Leu Glu Ser Thr Lys Leu 
275 280 285 

Met He Glu Asp He Gin Lys Ala Leu Lys 
290 295 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



TGGCAATTCT 


GGCGGAAGTA 


AAGATGCTGC 


CAAATCAGGT 


GGTGACGGTG 


CCAAAACAGA 


60 


AATCACTTGG 


TGGGCATTCC 


CAGTATTTAC 


CCAAGAAAAA 


ACTGGTGACG 


GTGTTGGAAC 


120 


TTATGAAAAA 


TCAATCATCG 


AAGCGTTTGA 


AAAAGCAAAC 


CCAGATATAA 


AAGTGAAATT 


180 


GGAAACCATC 


GACTTCAAGT 


CAGGTCCTGA 


AAAAATCACA 


ACAGCCATCG 


AAGCAGGAAC 


240 


AGCTCCAGAC 


GTACTCTTTG 


ATGCACCAGG 


ACGTATCATC 


CAATACGGTA 


AAAACGGTAA 


300 
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ATTGGCTGAG 


TTGAATGACC 


TCTTCACAGA 


TGAATTTGTT 


AAAGATGTCA 


ACAATGAAAA 


360 


CATCGTACAA 


GCAAGTAAAG 


CTGGAGACAA 


GGCTTATATG 


TATCCGATTA 


GTTCTGCCCC 


420 


ATTCTACATG 


GCAATGAACA 


AGAAAATGTT 


AGAAGATGCT 


GGAGTAGCAA 


ACCTTGTAAA 


480 


AGAAGGTTGG 


ACAACTGATG 


ATTTTGAAAA 


AGTATTGAAA 


GCACTTAAAG 


ACAAGGGTTA 


540 


CACACCAGGT 


TCATTGTTCA 


GTTCTGGTCA 


AGGGGGAGAC 


CAAGGAACAC 


GTGCCTTTAT 


600 


CTCTAACCTT 


TATAGCGGTT 


CTGTAACAGA 


TGAAAAAGTT 


AGCAAATATA 


CAACTGATGA 


660 


TCCTAAATTC 


GTCAAAGGTC 


TTGAAAAAGC 


AACTAGCTGG 


ATTAAAGACA 


ATTTGATCAA 


720 


TAATGGTTCA 


CAATTTGACG 


GTGGGGCAGA 


TATCCAAAAC 


TTTGCCAACG 


GTCAAACATC 


780 


TTACACAATC 


CTTTGGGCAC 


CAGCTCAAAA 


TGGTATCCAA 


GCTAAACTTT 


TAGAAGCAAG 


840 


TAAGGTAGAA 


GTGGTAGAAG 


TACCATTCCC 


ATCAGACGAA 


GGTAAGCCAG 


CTCTTGAGTA 


900 


CCTTGTAAAC 


GGGTTTGCAG 


TATTCAACAA 


TAAAGACGAC 


AAGAAAGTCG 


CTGCATCTAA 


960 


GAAATTCATC 


CAGTTTATCG 


CAGATGACAA 


GGAGTGGGGA 


CCTAAAGACG 


TAGTTCGTAC 


1020 


AGGTGCTTTC 


CCAGTCCGTA 


CTTCATTTGG 


AAAACTTTAT 


GAAGACAAAC 


GCATGGAAAC 


1080 


AATCAGCGGC 


TGGACTCAAT 


ACTACTCACC 


ATACTACAAC 


ACTATTGATG 


GATTTGCTGA 


1140 


AATGAGAACA 


CTTTGGTTCC 


CAATGTTGCA 


ATCTGTATCA 


AATGGTGACG 


AAAAACCAGC 


1200 


AGATGCTTTG 


AAAGCCTTCA 


CTGAAAAAGC 


GAACGAAACA 


ATCAAAAAAG 


CTATGAAACA 


1260 



A 1261 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gly Asn Ser Gly Gly Ser Lys Asp Ala Ala Lys Ser Gly Gly Asp Gly 
15 10 15 

Ala Lys Thr Glu lie Thr Trp Trp Ala Phe Pro Val Phe Thr Gin Glu 
20 25 30 

Lys Thr Gly Asp Gly Val Gly Thr Tyr Glu Lys Ser lie lie Glu Ala 
35 40 45 

Phe Glu Lys Ala Asn Pro Asp lie Lys Val Lys Leu Glu Thr lie Asp 
50 55 60 

Phe Lys Ser Gly Pro Glu Lys lie Thr Thr Ala lie Glu Ala Gly Thr 
65 70 75 80 

Ala Pro Asp Val Leu Phe Asp Ala Pro Gly Arg lie lie Gin Tyr Gly 
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85 90 95 

Lys Asn Gly Lys Leu Ala Glu Leu Asn Asp Leu Phe Thr Asp Glu Phe 
100 105 110 

Val Lys Asp Val Asn Asn Glu Asn lie Val Gin Ala Ser Lys Ala Gly 
115 120 125 

Asp Lys Ala Tyr Met Tyr Pro lie Ser Ser Ala Pro Phe Tyr Met Ala 
130 135 140 

Met Asn Lys Lys Met Leu Glu Asp Ala Gly Val Ala Asn Leu Val Lys 
145 150 155 160 

Glu Gly Trp Thr Thr Asp Asp Phe Glu Lys Val Leu Lys Ala Leu Lys 
165 170 175 

Asp Lys Gly Tyr Thr Pro Gly Ser Leu Phe Ser Ser Gly Gin Gly Gly 
180 185 190 

Asp Gin Gly Thr Arg Ala Phe He Ser Asn Leu Tyr Ser Gly Ser Val 
195 200 205 

Thr Asp Glu Lys Val Ser Lys Tyr Thr Thr Asp Asp Pro Lys Phe Val 
210 215 220 

Lys Gly Leu Glu Lys Ala Thr Ser Trp He Lys Asp Asn Leu He Asn 
225 230 235 240 

Asn Gly Ser Gin Phe Asp Gly Gly Ala Asp He Gin Asn Phe Ala Asn 
245 250 255 

Gly Gin Thr Ser Tyr Thr He Leu Trp Ala Pro Ala Gin Asn Gly He 
260 265 270 

Gin Ala Lys Leu Leu Glu Ala Ser Lys Val Glu Val Val Glu Val Pro 
275 280 285 

Phe Pro Ser Asp Glu Gly Lys Pro Ala Leu Glu Tyr Leu Val Asn Gly 
290 295 300 

Phe Ala Val Phe Asn Asn Lys Asp Asp Lys Lys Val Ala Ala Ser Lys 
305 310 315 320 

Lys Phe He Gin Phe He Ala Asp Asp Lys Glu Trp Gly Pro Lys Asp 
325 330 335 

Val Val Arg Thr Gly Ala Phe Pro Val Arg Thr Ser Phe Gly Lys Leu 
340 345 350 

Tyr Glu Asp Lys Arg Met Glu Thr He Ser Gly Trp Thr Gin Tyr Tyr 
355 360 365 

Ser Pro Tyr Tyr Asn Thr He Asp Gly Phe Ala Glu Met Arg Thr Leu 
370 375 380 

Trp Phe Pro Met Leu Gin Ser Val Ser Asn Gly Asp Glu Lys Pro Ala 
385 390 395 400 

Asp Ala Leu Lys Ala Phe Thr Glu Lys Ala Asn Glu Thr He Lys Lys 
405 410 415 



Ala Met 



Lys Gin 
420 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTCACAAGAA AAAACAAAAA ATGAAGATGG AGAAACTAAG ACAGAACAGA CAGCCAAAGC 60 

TGATGGAACA GTCGGTAGTA AGTCTCAAGG AGCTGCCCAG AAGAAAGCAG AAGTGGTCAA 120 

TAAAGGTGAT TACTACAGCA TTCAAGGGAA ATACGATGAA ATCATCGTAG CCAACAAACA 180 

CTATCCATTG TCTAAAGACT ATAATCCAGG GGAAAATCCA ACAGCCAAGG CAGAGTTGGT 240 

CAAACTCATC AAAGCGATGC AAGAGGCAGG TTTCCCTATT AGTGATCATT ACAGTGGTTT 300 

TAGAAGTTAT GAAACTCAGA CCAAGCTCTA TCAAGATTAT GTCAACCAAG ATGGAAAGGC 360 

AGCAGCTGAC CGTTACTCTG CCCGTCCTGG CTATAGCGAA CACCAGACAG GCTTGGCCTT 420 

TGATGTGATT GGGACTGATG GTGATTTGGT GACAGAAGAA AAAGCAGCCC AATGGCTCTT 480 

GGATCATGCA GCTGATTATG GCTTTGTTGT CCGTTATCTC AAAGGCAAGG AAAAGGAAAC 540 

AGGCTATATG GCTGAAGAAT GGCACCTGCG TTATGTAGGA AAAGAAGCTA AAGAAATTGC 600 

TGCAAGTGGT CTCAGTTTGG AAGAATACTA TGGCTTTGAA GGCGGAGACT ACGTCGAT 658 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ser Gin Glu Lys Thr Lys Asn Glu Asp Gly Glu Thr Lys Thr Glu Gin 
15 10 15 

Thr Ala Lys Ala Asp Gly Thr Val Gly Ser Lys Ser Gin Gly Ala Ala 
20 25 30 

Gin Lys Lys Ala Glu Val Val Asn Lys Gly Asp Tyr Tyr Ser lie Gin 
35 40 45 

Gly Lys Tyr Asp Glu lie lie Val Ala Asn Lys His Tyr Pro Leu Ser 
50 55 60 

Lys Asp Tyr Asn Pro Gly Glu Asn Pro Thr Ala Lys Ala Glu Leu Val 
65 70 75 80 

Lys Leu lie Lys Ala Met Gin Glu Ala Gly Phe Pro lie Ser Asp His 
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85 90 95 

Tyr Ser Gly Phe Arg Ser Tyr Glu Thr Gin Thr Lys Leu Tyr Gin Asp 
100 105 110 

Tyr Val Asn Gin Asp Gly Lys Ala Ala Ala Asp Arg Tyr Ser Ala Arg 
115 120 125 

Pro Gly Tyr Ser Glu His Gin Thr Gly Leu Ala Phe Asp Val lie Gly 
130 135 140 

Thr Asp Gly Asp Leu Val Thr Glu Glu Lys Ala Ala Gin Trp Leu Leu 
145 150 155 160 

Asp His Ala Ala Asp Tyr Gly Phe Val Val Arg Tyr Leu Lys Gly Lys 
165 170 175 

Glu Lys Glu Thr Gly Tyr Met Ala Glu Glu Trp His Leu Arg Tyr Val 
180 IBS 190 

Gly Lys Glu Ala Lys Glu lie Ala Ala Ser Gly Leu Ser Leu Glu Glu 
195 200 205 

Tyr Tyr Gly Phe Glu Gly Gly Asp Tyr Val Asp 
210 215 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



GAAAGGTCTG 


TGGTCAAATA 


ATCTTACCTG 


CGGTTATGAT 


GAAAAAATAA 


TCTTGGAAAA 


60 


TATAAATATA 


AAAATACCTG 


AAGAAAAAAT 


ATCAGTTATT 


ATTGGGTCAA 


ATGGTTGTGG 


120 


GAAATCAACA 


CTCATTAAAA 


CCTTGTCTCG 


ACTTATAAAG 


CCATTAGAGG 


GAGAAGTATT 


180 


GCTTGATAAT 


AAATCAATTA 


ATTCTTATAA 


AGAAAAAGAT 


TTAGCAAAAC 


ACATAGCTAT 


240 


ATTACCTCAA 


TCTCCAATAA 


TCCCTGAATC 


AATAACAGTA 


GCTGATCTTG 


TAAGCCGTGG 


300 


TCGTTTCCCC 


TACAGAAAGC 


CTTTTAAGAG 


TCTTGGAAAA 


GATGACCTTG 


AAATAATAAA 


360 


CAGATCAATG 


GTTAAGGCCA 


ATGTTGAAGA 


TCTAGCAAAT 


AACCTAGTTG 


AAGAACTTTC 


420 


TGGGGGTCAA 


AGGCAAAGAG 


TATGGATAGC 


TCTAGCCCTA 


GCCCAAGATA 


CAAGTATCCT 


480 


ACTTTTAGAT 


GAGCCAACTA 


CTTACTTGGA 


TATCTCATAT 


CAAATAGAAC 


TATTAGACCT 


540 


CTTGACTGAT 


CTAAACCAAA 


AATATAAGAC 


AACCATTTGC 


ATGATTTTGC 


ACGATATAAA 


600 


TCTAACAGCA 


AGATACGCTG 


ATTACCTATT 


TGCAATTAAA 


GAAGGTAAAC 


TTGTTGCAGA 


660 


GGGAAAGCCT 


GAAGATATAC 


TAAATGATAA 


ACTAGTTAAA 


GATATCTTTA 


ATCTTGAAGC 


720 


AAAAATTATA 


CGTGACCCTA 


TTTCCAATTC 


GCCTCTAATG 


ATTCCTATTG 


GCAAGCACCA 


780 
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TGTTAACTCT 790 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Lys Gly Leu Trp Ser Asn Asn Leu Thr Cys Gly Tyr Asp Glu Lys lie 
15 10 15 

lie Leu Glu Asn lie Asn lie Lys lie Pro Glu Glu Lys lie Ser Val 
20 25 30 

lie lie Gly Ser Asn Gly Cys Gly Lys Ser Thr Leu lie Lys Thr Leu 
35 40 45 

Ser Arg Leu lie Lys Pro Leu Glu Gly Glu Val Leu Leu Asp Asn Lys 
50 55 60 

Ser lie Asn Ser Tyr Lys Glu Lys Asp Leu Ala Lys His lie Ala lie 
65 70 75 80 

Leu Pro Gin Ser Pro lie lie Pro Glu Ser lie Thr Val Ala Asp Leu 
85 90 95 

Val Ser Arg Gly Arg Phe Pro Tyr Arg Lys Pro Phe Lys Ser Leu Gly 
100 105 110 

Lys Asp Asp Leu Glu lie lie Asn Arg Ser Met Val Lys Ala Asn Val 
115 120 125 

Glu Asp Leu Ala Asn Asn Leu Val Glu Glu Leu Ser Gly Gly Gin Arg 
130 135 140 

Gin Arg Val Trp lie Ala Leu Ala Leu Ala Gin Asp Thr Ser lie Leu 
145 150 155 160 

Leu Leu Asp Glu Pro Thr Thr Tyr Leu Asp lie Ser Tyr Gin lie Glu 
165 170 175 

Leu Leu Asp Leu Leu Thr Asp Leu Asn Gin Lys Tyr Lys Thr Thr lie 
180 185 . 190 

Cys Met lie Leu His Asp lie Asn Leu Thr Ala Arg Tyr Ala Asp Tyr 
195 200 205 

Leu Phe Ala lie Lys Glu Gly Lys Leu Val Ala Glu Gly Lys Pro Glu 
210 215 220 

Asp lie Leu Asn Asp Lys Leu Val Lys Asp lie Phe Asn Leu Glu Ala 
225 230 235 240 

Lys lie lie Arg Asp Pro lie Ser Asn Ser Pro Leu Met lie Pro lie 
245 250 255 



Gly Lys His His Val Ser 
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260 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



AAACTCAGAA 


AAGAAAGCAG 


ACAATGCAAC 


AACTATCAAA 


ATCGCAACTG 


TTAACCGTAG 


60 


CGGTTCTGAA 


GAAAAACGTT 


GGGACAAAAT 


CCAAGAATTG 


GTTAAAAAAG 


ACGGAATTAC 


12 0 


CTTGGAATTT 


ACAGAGTTCA 


CAGACTACTC 


ACAACCAAAC 


AAAGCAACTG 


CTGATGGCGA 


180 


AGTAGATTTG 


AACGCTTTCC 


AACACTATAA 


CTTCTTGAAC 


AACTGGAACA 


AAGAAAACGG 


240 


AAAAGACCTT 


GTAGCGATTG 


CAGATACTTA 


CATCTCTCCA 


ATCCGCCTTT 


ACTCAGGTTT 


300 


GAATGGAAGT 


GCCAACAAGT 


ACACTAAAGT 


AGAAGACATC 


CCAGCAAACG 


GAGAAATCGC 


360 


TGTACCGAAT 


GACGCTACAA 


ACGAAAGCCG 


TGCGCTTTAT 


TTGCTTCAAT 


CAGCTGGCTT 


420 


GATTAAATTG 


GATGTTTCTG 


GAACTGCTCT 


TGCAACAGTT 


GCCAACATCA 


AAGAAAATCC 


480 


AAAGAACTTG 


AAAATCACTG 


AATTGGACGC 


TAGCCAAACA 


GCTCGTTCAT 


TGTCATCAGT 


540 


TGACGCTGCC 


GTTGTAAACA 


ATACCTTCGT 


TACAGAAGCA 


AAATTGGACT 


ACAAGAAATC 


600 


ACTTTTCAAA 


GAACAAGCTG 


ATGAAAACTC 


AAAACAATGG 


TACAACATCA 


TTGTTGCAAA 


660 


AAAAGATTGG 


GAAACATCAC 


CTAAGGCTGA 


TGCTATCAAG 


AAAGTAATCG 


CAGCTTACCA 


720 


CACAGATGAC 


GTGAAAAAAG 


TTATCGAAGA 


ATCATCAGAT 


GGTTTGGATC 


AACCAGTTTG 


780 


G 












781 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Asn Ser Glu Lys Lys Ala Asp Asn Ala Thr Thr lie Lys lie Ala Thr 
15 10 15 

Val Asn Arg Ser Gly Ser Glu Glu Lys Arg Trp Asp Lys lie Gin Glu 
20 25 30 

Leu Val Lys Lys Asp Gly lie Thr Leu Glu Phe Thr Glu Phe Thr Asp 
35 40 45 
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Tyr Ser Gin Pro Asn Lys Ala Thr Ala Asp Gly Glu Val Asp Leu Asn 
50 55 60 

Ala Phe Gin His Tyr Asn Phe Leu Asn Asn Trp Asn Lys Glu Asn Gly 
65 70 75 80 

Lys Asp Leu Val Ala lie Ala Asp Thr Tyr lie Ser Pro lie Arg Leu 
85 90 95 

Tyr Ser Gly Leu Asn Gly Ser Ala Asn Lys Tyr Thr Lys Val Glu Asp 
100 105 110 

lie Pro Ala Asn Gly Glu lie Ala Val Pro Asn Asp Ala Thr Asn Glu 
115 120 125 

Ser Arg Ala Leu Tyr Leu Leu Gin Ser Ala Gly Leu lie Lys Leu Asp 
130 135 140 

Val Ser Gly Thr Ala Leu Ala Thr Val Ala Asn lie Lys Glu Asn Pro 
145 150 155 160 

Lys Asn Leu Lys lie Thr Glu Leu Asp Ala Ser Gin Thr Ala Arg Ser 
165 170 175 

Leu Ser Ser Val Asp Ala Ala Val Val Asn Asn Thr Phe Val Thr Glu 
180 185 190 

Ala Lys Leu Asp Tyr Lys Lys Ser Leu Phe Lys Glu Gin Ala Asp Glu 
195 200 205 

Asn Ser Lys Gin Trp Tyr Asn lie lie Val Ala Lys Lys Asp Trp Glu 
210 215 220 

Thr Ser Pro Lys Ala Asp Ala lie Lys Lys Val lie Ala Ala Tyr His 
225 230 235 240 

Thr Asp Asp Val Lys Lys Val lie Glu Glu Ser Ser Asp Gly Leu Asp 
245 250 255 

Gin Pro Val Trp 
260 

(2) INFORMATION FOR SEQ ID NO : 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TTCGAAAGGG TCAGAAGGTG CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA 60 

TCAATTTTAT GAGCAAGTGA AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC 120 

CATCCAAAAA GTTTTTGAAA AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATGA 180 

TACTATTGCC GAAGAAAAAA AACAATATGG CGAAAACTAC CAACGTGTCT TGTCACAAGC 240 

AGGTATGACT CTTGAAACAC GTAAAGCTCA AATTCGTACA AGTAAATTAG TTGAGTTGGC 300 
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AGTTAAGAAG GTAGCAGAAG CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATGA 360 

GTACACTCCA GATGTAACGG CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGCCAAAGA 420 

AGTTCTCGAA AAAGCCAAGG CAGAAGGTGC TGATTTTGCT CAATTAGCCA AAGATAATTC 480 

AACTGATGAA AAAACAAAAG AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA 540 

AGTACCTGGA GCAAGTCCAA AAAAGCCGCT TTTCGCTTTT AGATGTGGGA TGGTGTTTCT 600 

GGATGTGGAT TACAGCAACT GGGGCACACC AAGCCTACAG 640 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Ser Lys Gly Ser Glu Gly Ala Asp Leu lie Ser Met Lys Gly Asp Val 
1 5 10 15 

lie Thr Glu His Gin Phe Tyr Glu Gin Val Lys Ser Asn Pro Ser Ala 
20 25 30 

Gin Gin Val Leu Leu Asn Met Thr lie Gin Lys Val Phe Glu Lys Gin 
35 40 45 

Tyr Gly Ser Glu Leu Asp Asp Lys Glu Val Asp Asp Thr lie Ala Glu 
50 55 60 

Glu Lys Lys Gin Tyr Gly Glu Asn Tyr Gin Arg Val Leu Ser Gin Ala 
65 70 75 80 

Gly Met Thr Leu Glu Thr Arg Lys Ala Gin lie Arg Thr Ser Lys Leu 
85 90 95 

Val Glu Leu Ala Val Lys Lys Val Ala Glu Ala Glu Leu Thr Asp Glu 
100 105 110 

Ala Tyr Lys Lys Ala Phe Asp Glu Tyr Thr Pro Asp Val Thr Ala Gin 
115 120 125 

lie lie Arg Leu Asn Asn Glu Asp Lys Ala Lys Glu Val Leu Glu Lys 
130 135 140 

Ala Lys Ala Glu Gly Ala Asp Phe Ala Gin Leu Ala Lys Asp Asn Ser 
145 150 155 160 

Thr Asp Glu Lys Thr Lys Glu Asn Gly Gly Glu lie Thr Phe Asp Ser 
165 170 175 

Ala Ser Thr Glu Val Pro Gly Ala Ser Pro Lys Lys Pro Leu Phe Ala 
180 185 190 

Phe Arg Cys Gly Met Val Phe Leu Asp Val Asp Tyr Ser Asn Trp Gly 
195 200 205 
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Thr Pro Ser Leu Gin 
210 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGGGATGGCA GCTTTTAAAA ATCCTAACAA TCAATACAAA GCTATTACAA TTGCTCAAAC 60 

TCTAGGTGAT GATGCTTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG CTGTTCAGTG 120 

TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTTAAAACT AAAGCTACGG TTGTAGAAAA 180 

ACCACTGAAA GATTTTAGAG CGTCTACGTC TGATCAGTCT GGTTGGGTGG AATCTAATGG 240 

TAAATGGTAT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGAAAA CAGATGGTAA 3 00 

ATGGTACTAT TTGAATGACT TAGGTGTCAT GCAGACTGGA TTTGTAAAAT TTTCTGGTAG 3 60 

CTGGTATTAC TTGAGCAATT CAGGTGCTAT GTTTACAGGC TGGGGAACAG ATGGTAGCAG 420 

ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG AAAATGGCAC 480 

TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGGTTTAAAG TCGGACCACA 540 

CTGGTACTAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC CAGATGGTTA 600 

CCGTGTAAAT GGTAATGGTG AATGGGTAAA C 631 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Gly Met Ala Ala Phe Lys Asn Pro Asn Asn Gin Tyr Lys Ala lie Thr 
15 10 15 

lie Ala Gin Thr Leu Gly Asp Asp Ala Ser Ser Glu Glu Leu Ala Gly 
20 25 30 

Arg Tyr Gly Ser Ala Val Gin Cys Thr Glu Val Thr Ala Ser Asn Leu 
35 40 45 

Ser Thr Val Lys Thr Lys Ala Thr Val Val Glu Lys Pro Leu Lys Asp 
50 55 60 

Phe Arg Ala Ser Thr Ser Asp Gin Ser Gly Trp Val Glu Ser Asn Gly 
65 70 75 80 



37 



Lys Trp Tyr Phe'Tyr Glu Ser Gly Asp Val Lys Thr Gly Trp Val Lys 
85 90 95 

Thr Asp Gly Lys Trp Tyr Tyr Leu Asn Asp Leu Gly Val Met Gin Thr 
100 105 110 

Gly Phe Val Lys Phe Ser Gly Ser Trp Tyr Tyr Leu Ser Asn Ser Gly 
115 120 125 

Ala Met Phe Thr Gly Trp Gly Thr Asp Gly Ser Arg Trp Phe Tyr Phe 
130 135 140 

Asp Gly Ser Gly Ala Met Lys Thr Gly Trp Tyr Lys Glu Asn Gly Thr 
145 150 155 160 

Trp Tyr Tyr Leu Asp Glu Ala Gly lie Met Lys Thr Gly Trp Phe Lys 
165 170 175 

Val Gly Pro His Trp Tyr Tyr Ala Tyr Gly Ser Gly Ala Leu Ala Val 
180 185 190 

Ser Thr Thr Thr Pro Asp Gly Tyr Arg Val Asn Gly Asn Gly Glu Trp 
195 200 205 

Val Asn 
210 

(2) INFORMATION FOR SEQ ID NO: 37: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



AGACGAGCAA 


AAAATTAAGC 


AAGCAGAAGC 


GGAAGTTGAG 


AGTAAACAAG 


CTGAGGCTAC 


60 


AAGGTTAAAA 


AAAATCAAGA 


CAGATCGTGA 


AGAAGCAGAA 


GAAGAAGCTA 


AACGAAGAGC 


120 


AGATGCTAAA 


GAGCAAGGTA 


AACCAAAGGG 


GCGGGCAAAA 


CGAGGAGTTC 


CTGGAGAGCT 


180 


AGCAACACCT 


GATAAAAAAG 


AAAATGATGC 


GAAGTCTTCA 


GATTCTAGCG 


TAGGTGAAGA 


240 


AACTCTTCCA 


AGCCCATCCC 


TGAAACCAGA 


AAAAAAGGTA 


GCAGAAGCTG 


AGAAGAAGGT 


300 


TGAAGAAGCT 


AAGAAAAAAG 


CCGAGGATCA 


AAAAGAAGAA 


GATCGCCGTA 


ACTACCCAAC 


360 


CAATACTTAC 


AAAACGCTTG 


AACTTGAAAT 


TGCTGAGTCC 


GATGTGGAAG 


TTAAAAAAGC 


420 


GGAGCTTGAA 


CTAGTAAAAG 


AGGAAGCTAA 


GGAACCTCGA 


AACGAGGAAA 


AAGTTAAGCA 


480 


AGCAAAAGCG 


GAAGTTGAGA 


GTAAAAAAGC 


TGAGGCTACA 


AGGTTAGAAA 


AAATCAAGAC 


540 


AGATCGTAAA 


AAAGCAGAAG 


AAGAAGCTAA 


ACGAAAAGCA 


GCAGAAGAAG 


ATAAAGTTAA 


600 


AGAAAAACCA 


GCTGAACAAC 


CACAACCAGC 


GCCGGCTCCA 


AAAGCAGAAA 


AACCAGCTCC 


660 


AGCTCCAAAA 


CCAGAGAATC 


CAGCTGAACA 


ACCAAAAGCA 


GAAAAACCAG 


CTGATCAACA 


720 
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AGCTGAAGAA GACTATGCTC GTAGATCAGA AGAAGAATAT AATCGCTTGA CTCAACAGCA 780 

ACCGCCAAAA ACTGAAAAAC CAGCACAACC ATCTACTCCA AAAACAGGCT GGAAACAAGA 840 

AAACGGTATG TGGTACTTCT ACAATACTGA TGGTTCAATG GCGACAGGAT GGCTCCAAAA 900 

CAATGGCTCA TGGTACTACC TCAACAGCAA TGGCGCTATG GCGACAGGAT GGCTCCAAAA 960 

CAATGGTTCA TGGTACTATC TAAACGCTAA TGGTTCAATG GCAACAGGAT GGCTCCAAAA 1020 

CAATGGTTCA TGGTACTACC TAAACGCTAA TGGTTCAATG GCGACAGGAT GGCTCCAATA 1080 

CAATGGCTCA TGGTACTACC TAAACGCTAA TGGTTCAATG GCGACAGGAT GGCTCCAATA 1140 

CAATGGCTCA TGGTACTACC TAAACGCTAA TGGTGATATG GCGACAGGTT GGGTGAAAGA 1200 

TGGAGATACC TGGTACTATC TTGAAGCATC AGGTGCTATG AAAGCAAGCC AATGGTTCAA 1260 

AGTATCAGAT AAATGGTACT ATGTCAATGG CTCAGGTGCC CTTGCAGTCA ACACAACTGT 1320 

AGATGGCTAT GGAGTCAATG CCAATGGTGA ATGGGTAAAC 1360 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Asp Glu Gin Lys lie Lys Gin Ala Glu Ala Glu Val Glu Ser Lys Gin 
15 10 15 

Ala Glu Ala Thr Arg Leu Lys Lys lie Lys Thr Asp Arg Glu Glu Ala 
20 25 30 

Glu Glu Glu Ala Lys Arg Arg Ala Asp Ala Lys Glu Gin Gly Lys Pro 
35 40 45 

Lys Gly Arg Ala Lys Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp 
50 55 60 

Lys Lys Glu Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu 
65 70 75 80 

Thr Leu Pro Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala 
85 90 95 

Glu Lys Lys Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu 
100 105 110 

Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu 
115 120 125 

Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu 
130 135 140 

Val Lys Glu Glu Ala Lys Glu Pro Arg Asn Glu Glu Lys Val Lys Gin 
145 150 155 160 
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Ala Lys Ala Glu Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu 
165 170 175 

Lys lie Lys Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys 
180 185 190 

Ala Ala Glu Glu Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin 
195 200 205 

Pro Ala Pro Ala Pro Lys Ala Glu Lys Pro Ala Pro Ala Pro Lys Pro 
210 215 220 

Glu Asn Pro Ala Glu Gin Pro Lys Ala Glu Lys Pro Ala Asp Gin Gin 
225 230 235 240 

Ala Glu Glu Asp Tyr Ala Arg Arg Ser Glu Glu Glu Tyr Asn Arg Leu 
245 250 255 

Thr Gin Gin Gin Pro Pro Lys Thr Glu Lys Pro Ala Gin Pro Ser Thr 
260 265 270 

Pro Lys Thr Gly Trp Lys Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn 
275 280 285 

Thr Asp Gly Ser Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp 
290 295 300 

Tyr Tyr Leu Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Leu Gin Asn 
305 310 315 320 

Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ser Met Ala Thr Gly 
325 330 335 

Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ser 
340 345 350 

Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn 
355 360 365 

Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp 
370 375 380 

Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Thr Gly Trp Val Lys Asp 
385 390 395 400 

Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met Lys Ala Ser 
405 410 415 

Gin Trp Phe Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gly Ser Gly 
420 425 430 

Ala Leu Ala Val Asn Thr Thr Val Asp Gly Tyr Gly Val Asn Ala Asn 
435 440 445 

Gly Glu Trp Val Asn 
450 

INFORMATION FOR SEQ ID NO : 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



40 




(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CTGTGGTGAG GAAGAAACTA AAAAGACTCA AGCAGCACAA CAGCCAAAAC AACAAACGAC 60 

TGTACAACAA ATTGCTGTTG GAAAAGATGC TCCAGACTTC ACATTGCAAT CCATGGATGG 120 

CAAAGAAGTT AAGTTATCTG ATTTTAAGGG TAAAAAGGTT TACTTGAAGT TTTGGGCTTC 180 

ATGGTGTGGT CCATGCAAGA AAAGTATGCC AGAGTTGATG GAACTAGCGG CGAAACCAGA 240 

TCGTGATTTC GAAATTCTTA CTGTCATTGC ACCAGGAATT CAAGGTGAAA AAACTGTTGA 300 

GCAATTGCCA CAATGGTTCC AGGAACAAGG ATATAAGGAT ATCCCAGTTC TTTATGATAC 360 

CAAAGCAACC ACTTCCAAGC TTATCAAATT CGAAGCATTC CTACAGAATA TT 412 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Cys Gly Glu Glu Glu Thr Lys Lys Thr Gin Ala Ala Gin Gin Pro Lys 
15 10 15 

Gin Gin Thr Thr Val Gin Gin lie Ala Val Gly Lys Asp Ala Pro Asp 
20 25 30 

Phe Thr Leu Gin Ser Met Asp Gly Lys Glu Val Lys Leu Ser Asp Phe 
35 40 45 

Lys Gly Lys Lys Val Tyr Leu Lys Phe Trp Ala Ser Trp Cys Gly Pro 
50 55 60 

Cys Lys Lys Ser Met Pro Glu Leu Met Glu Leu Ala Ala Lys Pro Asp 
65 70 75 80 

Arg Asp Phe Glu lie Leu Thr Val lie Ala Pro Gly lie Gin Gly Glu 
85 90 95 

Lys Thr Val Glu Gin Phe Pro Gin Trp Phe Gin Glu Gin Gly Tyr Lys 
100 105 110 

Asp lie Pro Val Leu Tyr Asp Thr Lys Ala Thr Thr Ser Lys Leu lie 
115 120 125 

Lys Phe Glu Ala Phe Leu Gin Asn lie 
130 135 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1462 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



GACTTTTAAC 


AATAAAACTA 


TTGAAGAGTT 


GCACAATCTC 


CTTGTCTCTA 


AGGAAATTTC 


60 


TGCAACAGAA 


TTGACCCAAG 


CAACACTTGA 


AAATATCAAG 


TCTCGTGAGG 


TV TV /^f^f~^/~^rT\f~^'7\. TV 

AAGCCCTCAA 


120 


TTCATTTGTC 


ACCATCGCTG 


AGGAGCAAGC 


TCTTGTTCAA 


GCTAAAGCCA 


mm^^ TV tv tv 

TTGATGAAGC 


180 


tGGAATTGAT 


GCTGACAATG 


TCCTTTCAGG 


AATTCCACTT 


GCTGTTAAGG 


TV m TV TV TV rn/^m/^ 

ATAACATCTC 


240 


TACAGACGGT 


ATTCTCACAA 


CTGCTGCCTC 


AAAAATGCTC 


TACAACTATG 


TV /^/^/^TV TV m/^mm 

AGCCAATCTT 


o n rv 

300 


TGATGCGACa 


gCTgTTGCCA 


ATGCAAAAAC 


CAAGGGCATG 


ATTGTCGTTG 


TV TV TV TV /~t TV TV 

GAAAGACCAA 


"5 ^ A 

360 


CATGGACGAA 


TTTGCTATGG 


GTGGTTCAGG 


tGAAACTTCA 


CACTACGGAG 


TV TV m TV TV TV TV TV 

CAACTAAAAA 


/I O A 

420 


CGCTTGGAAC 


CACAGCAAGG 


TTCCTGGTGG 


GTCATCAAGT 


GGTTCTGCCG 


CAGCTGTAGC 


4oO 


CTCAGGACAA 


GTTCGCTTGT 


CACTTGGTTC 


TGATACTGGT 


GGTTCCATCC 


GCCAACCTGC 


d40 


TGCCTTCAAC 


GGAATCGTTG 


GTCTCAAACC 


AACCTACGGA 


ACAGTTTCAC 


GTTTCGGTCT 


ri A 

600 


CATTGCCTTT 


GGTAGCTCAT 


TAGACCAGAT 


TGGACCTTTT 


GCTCCTACTG 


iT^rmv TV ^^^*^TV TV TV TV 

TTAAGGAAAA 


660 


TGCCCTCTTG 


CTCAACGCTA 


TTGCCAGCGA 


AGATGCTAAA 


GACTCTACTT 


CTGCTCCTGT 


720 


CCGCATCGCC 


GACTTTACTT 


CAAAAATCGG 


CCAAGACATC 


AAGGGTATGA 


AAATCGCTTT 


780 


GCCTAAGGAA 


TACCTAGGCG 


AAGGAATTGA 


TCCAGAGGTT 


AAGGAAACAA 


TCTTAAACGC 


840 


GGCCAAACAC 


TTTGAAAAAT 


TGGGTGCTAT 


CGTCGAAGAA 


GTCAGCCTTC 


CTCACTCTAA 


900 


ATACGGTGTT 


GCCGTTTATT 


ACATCATCGC 


TTCATCAGAA 


GCTTCATCAA 


ACTTGCAACG 


960 


CTTCGACGGT 


ATCCGTTACG 


GCTATCGCGC 


AGAAGATGCA 


ACCAACCTTG 


TV TV TV TVfm^^fTlTV 

ATGAAATC T A 


1 A A 

1020 


TGTAAACAGC 


CGAAGCCAAG 


GTTTTGGTGA 


AGAGGTAAAA 


CGTCGTATCA 


TGCTGGGTAC 


1080 


TTTCAGTCTT 


TCATCAGGTT 


ACTATGATGC 


CTACTACAAA 


AAGGCTGGTC 


AAGTCCGTAC 


1140 


CCTCATCATT 


CAAGATTTCG 


AAAAAGTCTT 


CGCGGATTAC 


GATTTGATTT 


TGGGTCCAAC 


1200 


TGCTCCAAGT 


GTTGCCTATG 


ACTTGGATTC 


TCTCAACCAT 


GACCCAGTTG 


CCATGTACTT 


1260 


AGCCGACCTA 


TTGACCATAC 


CTGTAAACTT 


GGCAGGACTG 


CCTGGAATTT 


CGATTCCTGC 


1320 


TGGATTCTCT 


CAAGGTCTAC 


CTGTCGGACT 


CCAATTGATT 


GGTCCCAAGT 


ACTCTGAGGA 


1380 


AACCATTTAC 


CAAGCTGCTG 


CTGCTTTTGA 


AGCAACAACA 


GACTACCACA 


AACAACAACC 


1440 


CGTGATTTTT 


GGAGGTGACA 


AC 








1462 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Thr Phe Asn Asn Lys Thr lie Glu Glu Leu His Asn Leu Leu Val Ser 
1 5 10 15 

Lys Glu lie Ser Ala Thr Glu Leu Thr Gin Ala Thr Leu Glu Asn lie 
20 25 30 

Lys Ser Arg Glu Glu Ala Leu Asn Ser Phe Val Thr lie Ala Glu Glu 
35 40 45 

Gin Ala Leu Val Gin Ala Lys Ala He Asp Glu Ala Gly He Asp Ala 
50 55 60 

Asp Asn Val Leu Ser Gly He Pro Leu Ala Val Lys Asp Asn He Ser 
65 70 75 80 

Thr Asp Gly He Leu Thr Thr Ala Ala Ser Lys Met Leu Tyr Asn Tyr 
85 90 95 

Glu Pro He Phe Asp Ala Thr Ala Val Ala Asn Ala Lys Thr Lys Gly 
100 105 110 

Met He Val Val Gly Lys Thr Asn Met Asp Glu Phe Ala Met Gly Gly 
115 120 125 

Ser Gly Glu Thr Ser His Tyr Gly Ala Thr Lys Asn Ala Trp Asn His 
130 135 140 

Ser Lys Val Pro Gly Gly Ser Ser Ser Gly Ser Ala Ala Ala Val Ala 
145 150 155 160 

Ser Gly Gin Val Arg Leu Ser Leu Gly Ser Asp Thr Gly Gly Ser He 
165 i70 175 

Arg Gin Pro Ala Ala Phe Asn Gly He Val Gly Leu Lys Pro Thr Tyr 
180 185 190 

Gly Thr Val Ser Arg Phe Gly Leu He Ala Phe Gly Ser Ser Leu Asp 
195 200 205 

Gin He Gly Pro Phe Ala Pro Thr Val Lys Glu Asn Ala Leu Leu Leu 
210 215 220 

Asn Ala He Ala Ser Glu Asp Ala Lys Asp Ser Thr Ser Ala Pro Val 
225 230 235 240 

Arg He Ala Asp Phe Thr Ser Lys He Gly Gin Asp He Lys Gly Met 
245 250 255 

Lys He Ala Leu Pro Lys Glu Tyr Leu Gly Glu Gly He Asp Pro Glu 
260 * 265 270 

Val Lys Glu Thr He Leu Asn Ala Ala Lys His Phe Glu Lys Leu Gly 
275 280 285 

Ala He Val Glu Glu Val Ser Leu Pro His Ser Lys Tyr Gly Val Ala 
290 295 300 
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Val Tyr Tyr lie lie Ala Ser Ser Glu Ala Ser Ser Asn Leu Gin Arg 
305 310 315 320 

Phe Asp Gly lie Arg Tyr Gly Tyr Arg Ala Glu Asp Ala Thr Asn Leu 
325 330 335 

Asp Glu He Tyr Val Asn Ser Arg Ser Gin Gly Phe Gly Glu Glu Val 
340 345 350 

Lys Arg Arg He Met Leu Gly Thr Phe Ser Leu Ser Ser Gly Tyr Tyr 
355 360 365 

Asp Ala Tyr Tyr Lys Lys Ala Gly Gin Val Arg Thr Leu He He Gin 
370 375 380 

Asp Phe Glu Lys Val Phe Ala Asp Tyr Asp Leu He Leu Gly Pro Thr 
385 390 395 400 

Ala Pro Ser Val Ala Tyr Asp Leu Asp Ser Leu Asn His Asp Pro Val 
405 410 415 

Ala Met Tyr Leu Ala Asp Leu Leu Thr He Pro Val Asn Leu Ala Gly 
420 425 430 

Leu Pro Gly He Ser He Pro Ala Gly Phe Ser Gin Gly Leu Pro Val 
435 440 445 

Gly Leu Gin Leu He Gly Pro Lys Tyr Ser Glu Glu Thr He Tyr Gin 
450 455 460 

Ala Ala Ala Ala Phe Glu Ala Thr Thr Asp Tyr His Lys Gin Gin Pro 
465 470 475 480 

Val He Phe Gly Gly Asp Asn 
485 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CTTTACAGGT AAACAACTAC AAGTCGGCGA CAAGGCGCTT GATTTTTCTC TTACTACAAC 60 

AGATCTTTCT AAAAAATCTC TGGCTGATTT TGATGGCAAG AAAAAAGTCT TGAGTGTCGT 120 

TCCTTCTATC GATACAGGCA TCTGCTCAAC TCAAACACGT CGTTTTAATG AAGAATTGGC 180 

TGGACTGGAC AACACGGTCG TATTGACTGT TTCAATGGAC CTACCTTTTG CTCAAAAACG 240 

TTGGTGCGGT GCTGAAGGCC TTGACAATGC CATTATGCTT TCAGACTACT TTGACCATTC 300 

TTTCGGGCGC GATTATGCCC TCTTGATCAA CGAATGGCAC CTATTAGCAC GCGCAGTCTT 360 

TGTCCTCGAT ACTGACAATA CGATTCGCTA CGTTGAATAC GTGGATAATA TCAATTCTGA 420 

GCCAAACTTC GAA 433 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Phe Thr Gly Lys Gin Leu Gin Val Gly Asp Lys Ala Leu Asp Phe Ser 
1 5 10 15 

Leu Thr Thr Thr Asp Leu Ser Lys Lys Ser Leu Ala Asp Phe Asp Gly 
20 25 30 

Lys Lys Lys Val Leu Ser Val Val Pro Ser lie Asp Thr Gly lie Cys 
35 40 45 

Ser Thr Gin Thr Arg Arg Phe Asn Glu Glu Leu Ala Gly Leu Asp Asn 
50 55 60 

Thr Val Val Leu Thr Val Ser Met Asp Leu Pro Phe Ala Gin Lys Arg 
65 70 . 75 80 

Trp Cys Gly Ala Glu Gly Leu Asp Asn Ala lie Met Leu Ser Asp Tyr 
85 90 95 

Phe Asp His Ser Phe Gly Arg Asp Tyr Ala Leu Leu lie Asn Glu Trp 
100 105 110 

His Leu Leu Ala Arg Ala Val Phe Val Leu Asp Thr Asp Asn Thr lie 
115 120 125 

Arg Tyr Val Glu Tyr Val Asp Asn lie Asn Ser Glu Pro Asn Phe Glu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO : 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



CCAGGCTGAT 


ACAAGTATCG 


CAGACATTCA 


AAAAAGAGGC 


GAACTGGTTG 


TCGGTGTCAA 


60 


ACAAGACGTT 


CCCAATTTTG 


GTTACAAnGA 


TCCCAAGACC 


GGTACTTATT 


CTGGTATCGA 


120 


AaCCGACTTG 


GCCAAGATGG 


TAGCTGATGA 


ACTCAAGGTC 


AAGATTCGCT 


ATGTGCCGGT 


180 


TACAGCACAA 


ACCCGCGGCC 


CCCTTCTAGA 


CAATGAACAG 


GTCGATATGG 


ATATCGCGAC 


240 


CTTTACCATC 


ACGGACGAAC 


GCAAAAAACT 


CTACAACTTT 


ACCAGTCCCT 


ACTACACAGA 


300 


CGCTTCTGGA 


TTTTTGGTCA 


ATAAATCTGC 


CAAAATCAAA 


AAGATTGAGG 


ACCTAAACGG 


360 
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CAAAACCATC GGAGTCGCCC AAGGTTCTAT CACCCAACGC CTGATTACTG AACTGGGTAA 420 

AAAGAAAGGT CTGAAGTTTA AATTCGTCGA ACTTGGTTCC TACCCAGAAT TGATTACTTC 480 

CCTGCACGCT CATCGTATCG ATACCTTTTC CGTTGACCGC TCTATTCTAT CTGGCTACAC 540 

TAGTAAACGG ACAGCACTAC TAGATGATAG TTTCAAGCCA TCTGACTACG GTATTGTTAC 600 

CAAGAAATCA AATACAGAGC TCAACGACTA TCTTGATAAC TTGGTTACTA AATGGAGCAA 660 

GGATGGTAGT TTGCAGAAAC TTTATGACCG TTACAAGCTC AAACCATCTA GCCATACTGC 720 

AGAT 724 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Gin Ala Asp Thr Ser lie Ala Asp lie Gin Lys Arg Gly Glu Leu Val 
15 10 15 

Val Gly Val Lys Gin Asp Val Pro Asn Phe Gly Tyr Xaa Asp Pro Lys 
20 25 30 

Thr Gly Thr Tyr Ser Gly lie Glu Thr Asp Leu Ala Lys Met Val Ala 
35 40 45 

Asp Glu Leu Lys Val Lys lie Arg Tyr Val Pro Val Thr Ala Gin Thr 
50 55 60 

Arg Gly Pro Leu Leu Asp Asn Glu Gin Val Asp Met Asp lie Ala Thr 
65 70 75 80 

Phe Thr lie Thr Asp Glu Arg Lys Lys Leu Tyr Asn Phe Thr Ser Pro 
85 90 95 

Tyr Tyr Thr Asp Ala Ser Gly Phe Leu Val Asn Lys Ser Ala Lys lie 
100 105 110 

Lys Lys lie Glu Asp Leu Asn Gly Lys Thr lie Gly Val Ala Gin Gly 
115 120 125 

Ser lie Thr Gin Arg Leu lie Thr Glu Leu Gly Lys Lys Lys Gly Leu 
130 135 140 

Lys Phe Lys Phe Val Glu Leu Gly Ser Tyr Pro Glu Leu lie Thr Ser 
145 150 155 160 

Leu His Ala His Arg lie Asp Thr Phe Ser Val Asp Arg Ser lie Leu 
165 170 175 

Ser Gly Tyr Thr Ser Lys Arg Thr Ala Leu Leu Asp Asp Ser Phe Lys 
180 185 190 

Pro Ser Asp Tyr Gly lie Val Thr Lys Lys Ser Asn Thr Glu Leu Asn 
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195 200 205 

Asp Tyr Leu Asp Asn Leu Val Thr Lys Trp Ser Lys Asp Gly Ser Leu 
210 215 220 

Gin Lys Leu Tyr Asp Arg Tyr Lys Leu Lys Pro Ser Ser His Thr Ala 
225 230 235 240 

Asp 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



GTCTGTATCA 


TTTGAAAACA 


AAGAAACAAA 


CCGTGGTGTC 


TTgACTTTCA 


CTATCTCTCA 


60 


AGACCAAATC 


AAACCAGAAT 


TGGACCGTGT 


CTTCAAGtCA 


GTGAAGAAAT 


CTCTTAATGT 


120 


TCCAGGTTTC 


CGTAAAGGTC 


ACCTTCCACG 


CCCTATCTTC 


GACCAAAAAT 


TTGGTGAAGA 


180 


AGCTCTTTAT 


CAAGATGCAA 


TGAACGCACT 


TTTGCCAAAC 


GCTTATGAAG 


CAGCTGTAAA 


240 


AGAAGCTGGT 


CTTGAAGTGG 


TTGCCCAACC 


AAAAATTGAC 


GTAACTTCAA 


TGGAAAAAGG 


300 


TCAAGACTGG 


GTTATCACTG 


CTGAAGTCGT 


TACAAAACCT 


GAAGTAAAAT 


TGGGTGACTA 


360 


CAAAAACCTT 


GAAGTATCAG 


TTGATGTAGA 


AAAAGAAGTA 


ACTGACGCTG 


ATGTCGAAGA 


420 


GCGTATCGAA 


CGCGAACGCA 


ACAACCTGGC 


TGAATTGGTT 


ATCAAGGAAG 


CTGCTGCTGA 


480 


AAACGGCGAC 


ACTGTTGTGA 


TCGACTTCGT 


TGGTTCTATC 


GACGGTGTTG 


AATTTGACGG 


540 


TGGAAAAGGT 


GAAAACTTCT 


CACTTGGACT 


TGGTTCAGGT 


CAATTCATCC 


CTGGTTTCGA 


600 


AGACCAATTG 


GTAGGTCACT 


CAGCTGGCGA 


AACCGTTGAT 


GTTATCGTAA 


CATTCCCAGA 


660 


AGACTACCAA 


GCAGAAGACC 


TTGCAGGTAA 


AGAAGCTAAA 


TTCGTGACAA 


CTATCCACGA 


720 


AGTAAAAGCT 


AAAGAAGTTC 


CGGCTCTTGA 


CGATGAACTT 


GCAAAAGACA 


TTGATGAAGA 


780 


AGTTGAAACA 


CTTGCTGACT 


TGAAAGAAAA 


ATACAGCAAA 


GAATTGGCTG 


CTGCTAAAGA 


840 


AGAAGCTTAC 


AAAGATGCAG 


TTGAAGGTGC 


AGCAATTGAT 


ACAGCTGTAG 


AAAATGCTGA 


900 


AATCGTAGAA 


CTTCCAGAAG 


AAATGATCCA 


TGAAGAAGTT 


CACCGTTCAG 


TAAATGAATT 


960 


CCTTGGGAAT 


TTGCAACGTC 


AAGGGATCAA 


CCCTGACATG 


TACTTCCAAA 


TCACTGGAAC 


1020 


TACTCAAGAA 


GACCTTCACA 


ACCAATACCA 


AGCAGAAGCT 


GAGTCACGTA 


CTAAGACTAA 


1080 


CCTTGTTATC 


GAAGCAGTTG 


CCAAAGCTGA 


AGGATTTGAT 


GCTTCAGAAG 


AAGAAATCCA 


1140 


AAAAGAAGTT 


GAGCAATTGG 


CAGCAGACTA 


CAACATGGAA 


GTTGCACAAG 


TTCAAAACTT 


1200 


GCTTTCAGCT 


GACATGTTGA 


AACATGATAT 


CACTATCAAA 


AAAGCTGTTG 


AATTGATCAC 


1260 
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AAGCACAGCA ACAGTAAAA 1279 
(2) INFORMATION. FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Ser Val Ser Phe Glu Asn Lys Glu Thr Asn Arg Gly Val Leu Thr Phe 
1 5 10 15 

Thr lie Ser Gin Asp Gin lie Lys Pro Glu Leu Asp Arg Val Phe Lys 
20 25 30 

Ser Val Lys Lys Ser Leu Asn Val Pro Gly Phe Arg Lys Gly His Leu 
35 40 45 

Pro Arg Pro lie Phe Asp Gin Lys Phe Gly Glu Glu Ala Leu Tyr Gin 
50 55 60 

Asp Ala Met Asn Ala Leu Leu Pro Asn Ala Tyr Glu Ala Ala Val Lys 
65 70 75 80 

Glu Ala Gly Leu Glu Val Val Ala Gin Pro Lys lie Asp Val Thr Ser 
85 90 95 

Met Glu Lys Gly Gin Asp Trp Val He Thr Ala Glu Val Val Thr Lys 
100 105 110 

Pro Glu Val Lys Leu Gly Asp Tyr Lys Asn Leu Glu Val Ser Val Asp 
115 120 125 

Val Glu Lys Glu Val Thr Asp Ala Asp Val Glu Glu Arg He Glu Arg 
130 135 140 

Glu Arg Asn Asn Leu Ala Glu Leu Val He Lys Glu Ala Ala Ala Glu 
145 150 155 160 

Asn Gly Asp Thr Val Val He Asp Phe Val Gly Ser He Asp Gly Val 
165 170 175 

Glu Phe Asp Gly Gly Lys Gly Glu Asn Phe Ser Leu Gly Leu Gly Ser 
180 185 190 

Gly Gin Phe He Pro Gly Phe Glu Asp Gin Leu Val Gly His Ser Ala 
195 200 205 

Gly Glu Thr Val Asp Val He Val Thr Phe Pro Glu Asp Tyr Gin Ala 
210 215 220 

Glu Asp Leu Ala Gly Lys Glu Ala Lys Phe Val Thr Thr He His Glu 
225 230 235 240 

Val Lys Ala Lys Glu Val Pro Ala Leu Asp Asp Glu Leu Ala Lys Asp 
245 250 255 
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He Asp Glu Glu Val Glu Thr Leu Ala Asp Leu Lys Glu Lys Tyr Ser 
260 265 270 

Lys Glu Leu Ala Ala Ala Lys Glu Glu Ala Tyr Lys Asp Ala Val Glu 
275 280 285 

Gly Ala Ala He Asp Thr Ala Val Glu Asn Ala Glu He Val Glu Leu 
290 295 300 

Pro Glu Glu Met He His Glu Glu Val His Arg Ser Val Asn Glu Phe 
305 310 315 320 

Leu Gly Asn Leu Gin Arg Gin Gly He Asn Pro Asp Met Tyr Phe Gin 
325 330 335 

He Thr Gly Thr Thr Gin Glu Asp Leu His Asn Gin Tyr Gin Ala Glu 
340 345 350 

Ala Glu Ser Arg Thr Lys Thr Asn Leu Val He Glu Ala Val Ala Lys 
355 360 365 

Ala Glu Gly Phe Asp Ala Ser Glu Glu Glu He Gin Lys Glu Val Glu 
370 375 380 

Gin Leu Ala Ala Asp Tyr Asn Met Glu Val Ala Gin Val Gin Asn Leu 
385 390 395 400 

Leu Ser Ala Asp Met Leu Lys His Asp He Thr He Lys Lys Ala Val 
405 410 415 

Glu Leu He Thr Ser Thr Ala Thr Val Lys 
420 425 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

TGGTCAAAAG GAAAGTCAGA CAGGAAAGGG GATGAAAATT GTGACCAGTT TTTATCCTAT 60 

CTACGCTATG GTTAAGGAAG TATCTGGTGA CTTGAATGAT GTTCGGATGA TTCAGTCAAG 120 

TAGTGGTATT CACTCCTTTG AACCTTCGGC AAATGATATC GCAGCCATCT ATGATGCAGA 180 

TGTCTTTGTT TACCATTCTC ATACACTCGA ATCTTGGGCA GGAAGTCTGG ATCCAAATCT 240 

AAAAAAATCC AAAGTGAAGG TCTTAGAGGC TTCTGAGGGA ATGACCTTGG AACGTGTCCC 300 

TGGACTAGAG GATGTGGAAG CAGGGGATGG AGTTGATGAA AAAACGCTCT ATGACCCTCA 360 

CACATGGCTA GATCCTGAAA AAGCTGGAGA AGAAGCCCAA ATTATCGCTG ATAAACTTTC 420 

AGAGGTGGAT AGTGAGCATA AAGAGACTTA TCAAAAAAAT GCGCAACCTT TATCAAAAAA 480 

GCTCAGGAAT 490 
(2) INFORMATION FOR SEQ ID NO: 50: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Gly Gin Lys Glu Ser Gin Thr Gly Lys Gly Met Lys lie Val Thr Ser 
1 5 . 10 15 

Phe Tyr Pro lie Tyr Ala Met Val Lys Glu Val Ser Gly Asp Leu Asn 
20 25 30 

Asp Val Arg Met lie Gin Ser Ser Ser Gly lie His Ser Phe Glu Pro 
35 40 45 

Ser Ala Asn Asp lie Ala Ala lie Tyr Asp Ala Asp Val Phe Val Tyr 
50 55 60 

His Ser His Thr Leu Glu Ser Trp Ala Gly Ser Leu Asp Pro Asn Leu 
65 70 75 80 

Lys Lys Ser Lys Val Lys Val Leu Glu Ala Ser Glu Gly Met Thr Leu 
85 90 95 

Glu Arg Val Pro Gly Leu Glu Asp Val Glu Ala Gly Asp Gly Val Asp 
100 105 110 

Glu Lys Thr Leu Tyr Asp Pro His Thr Trp Leu Asp Pro Glu Lys Ala 
115 120 125 

Gly Glu Glu Ala Gin lie lie Ala Asp Lys. Leu Ser Glu Val Asp Ser 
130 135 140 

Glu His Lys Glu Thr Tyr Gin Lys Asn Ala Gin Pro Leu Ser Lys Lys 
145 150 155 160 

Leu Arg Asn 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GAAGGATAGA TATATTTTAG CATTTGAGAC ATCCTGTGAT GAGACCAGTG TCGCCGTCTT 60 

GAAAAACGAC GATGAGCTCT TGTCCAATGT CATTGCTAGT CAAATTGAGA GTCACAAACG 120 

TTTTGGTGGC GTAGTGCCCG AAGTAGCCAG TCGTCACCAT GTCGAGGTCA TTACAGCCTG 180 

TATCGAGGAG GCATTGGCAG AAGCAGGGAT TACCGAAGAG GACGTGACAG CTGTTGCGGT 240 
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TACCTACGGA 


CCAGGCTTGG 


TCGGAGCCTT 


GCTAGTTGGT 


TTGTCAGCTG 


CCAAGGCCTT 


300 


TGCTTGGGCT 


CACGGACTTC 


CACTGATTCC 


TGTTAATCAC 


ATGGCTGGGC 


ACCTCATGGC 


360 


AGCTCAGAGT 


GTGGAGCCTT 


TGGAGTTTCC 


CTTGCTAGCC 


CTCTTGGTCA 


GCGGCGGACA 


420 


CACAGAGTTG 


GTTTATGTTT 


CGGAGGCAGG 


AGATTATAAG 


ATTGTTGGGG 


AAACCCGTGA 


480 


TGATGCGGTT 


GGTGAGGCTT 


ATGATAAGGT 


CGGCCGTGTC 


ATGGGCTTGA 


CCTATCCTGC 


540 


AGGTCGTGAG 


ATTGACGAGC 


TGGCTCATCA 


GGGGCAGGAT 


ATTTATGATT 


TCCCCCGTGC 


600 


CATGATTAAG 


GAAGATAATC 


TGGAGTTCTC 


CTTCTCAGGT 


TTGAAATCTG 


CCTTTATCAA 


660 


TCTTCATCAC 


AATGCCGAGC 


AAAAGGGAGA 


AAGCCTGTCT 


ACAGAAGATT 


TGTGTGCTTC 


720 


CTTCCAAGCA 


GCAGTTATGG 


ACATTCTCAT 


GGCAAAAACC 


AAGAAGGCTT 


TGGAGAAATA 


780 


TCCTGTTAAA 


ATCCTAGTTG 


TGGCAGGTGG 


TGTGGCAGCC 


AATAAAGGTC 


TCAGAGAACG 


840 


CCTAGCAGCC 


GAAATCACAG 


ATGTCAAGGT 


TATCATCCCC 


CCTCTGCGAC 


TCTGCGGAGA 


900 


CAATGCAGGT 


ATGATTGCCT 


ATGCCAGCGT 


CAGCNAGTGG 


AACAAAGAAA 


ACTTCGCAGG 


960 


CTGGGACCTC 


AATGCCAAAC 


CAAGTCTTGC 


CTTTGATACC 


ATGGAA 




1006 


(2) INFORMATION FOR SEQ ID NO: 52: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Lys Asp Arg Tyr lie Leu Ala Phe Glu Thr Ser Cys Asp Glu Thr Ser 
15 10 15 

Val Ala Val Leu Lys Asn Asp Asp Glu Leu Leu Ser Asn Val lie Ala 
20 25 30 

Ser Gin lie Glu Ser His Lys Arg Phe Gly Gly Val Val Pro Glu Val 
35 40 45 

Ala Ser Arg His His Val Glu Val lie Thr Ala Cys lie Glu Glu Ala 
50 55 60 

Leu Ala Glu Ala Gly lie Thr Glu Glu Asp Val Thr Ala Val Ala Val 
65 70 75 80 

Thr Tyr Gly Pro Gly Leu Val Gly Ala Leu Leu Val Gly Leu Ser Ala 
85 90 95 

Ala Lys Ala Phe Ala Trp Ala His Gly Leu Pro Leu lie Pro Val Asn 
100 105 110 

His Met Ala Gly His Leu Met Ala Ala Gin Ser Val Glu Pro Leu Glu 
115 120 125 
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Phe Pro Leu Leu Ala Leu Leu Val Ser Gly GlyijHis Thr Glu Leu Val 
130 135 i|^0 

Tyr Val Ser Glu Ala Gly Asp Tyr Lys lie Val'Gly Glu Thr Arg Asp 
145 150 155 160 

Asp Ala Val Gly Glu Ala Tyr Asp Lys Val Gly Arg Val Met Gly Leu 
165 170 175 

Thr Tyr Pro Ala Gly Arg Glu He Asp Glu Leu Ala His Gin Gly Gin 
180 185 190 

Asp He Tyr Asp Phe Pro Arg Ala Met He Lys Glu Asp Asn Leu Glu 
195 200 205 

Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe He Asn Leu His His Asn 
210 215 220 

Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr Glu Asp Leu Cys Ala Ser 
225 230 235 240 

Phe Gin Ala Ala Val Met Asp He Leu Met Ala Lys Thr Lys Lys Ala 
245 250 255 

Leu Glu Lys Tyr Pro Val Lys He Leu Val Val Ala Gly Gly Val Ala 
260 265 270 

Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala Ala Glu He Thr Asp Val 
275 280 285 

Lys Val He He Pro Pro Leu Arg Leu Cys Gly Asp Asn Ala Gly Met 
290 295 300 

He Ala Tyr Ala Ser Val Ser Xaa Trp Asn Lys Glu Asn Phe Ala Gly 
305 310 315 320 

Trp Asp Leu Asn Ala Lys Pro Ser Leu Ala Phe Asp Thr Met Glu 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GGTAGTTAAA GTTGGTATTA ACGGTTTCGG ACGTATCGGT CGTCTTGCTT TCCGTCGTAT 60 

CCAAAACGTA GAAGGTGTTG AAGTTACACG CATCAACGAC CTTACAGATC CAGTTATGCT 120 

TGCACACTTG TTGAAATACG ACACAACTCA AGGTCGTTTC GACGGTACTG TTGAAGTTAA 180 

AGAAGGTGGA TTTGAAGTTA ACGGTAAATT CATCAAAGTT TCTGCTGAAC GTGATCCAGA 240 

ACAAATCGAC TGGGCTACTG ACGGTGTAGA AATCGTTCTT GAAGCTACTG GTTTCTTTGC 300 

TAAGAAAGAA GCAGCTGAAA AACACCTTAA AGGTGGAGCT AAAAAAGTTG TTATCACTGC 360 

TCCTGGTGGA AACGACGTTA AAACAGTTGT ATTCAACACT AACCACGACG TTCTTGACGG 420 
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TACTGAAACA GTTATCTCAG GTGCTTCATG TACTACAAAC TGCTTGGCTC CAATGGCTAA 480 

AGCTCTTCAA GACAACTTTG GTGTTGTTGA AGGATTGATG ACTACTATCC ACGCTTACAC 540 

TGGTGACCAA ATGATCCTTG ACGGACCACA CCGTGGTGGT GACCTTCGCC GTGCTCGCGC 600 

TGGTGCTGCA AACATCGTTC CTAACTCAAC TGGTGCTGCA AAAGCTATCG GTCTTGTAAT 660 

CCCAGAATTG AATGGTAAAC TTGACGGATC TGCACAACGC GTTCCAACTC CAACTGGATC 720 

AGTTACTGAA TTGGTAGCAG TTCTTGAAAA GAACGTTACT GTTGATGAAG TGAACGCAGC 780 

TATGAAAGCA GCTTCAAACG AATCATACGG TTACACAGAA GATCCAATCG TATCTTCAGA 840 

TATCGTAGGT ATGTCTTACG GTTCATTGTT TGACGCAACT CAAACTAAAG TTCTTGACGT 900 

TGACGGTAAA CAATTGGTTA AAGTTGTATC ATGGTACGAC AACGAAATGT CATACACTGC 960 

ACAACTTGTT CGTACTCTTG GAATACTTCG CAAAAATTGC 1000 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Val Val Lys Val Gly He Asn Gly Phe Gly Arg He Gly Arg Leu Ala 
15 10 15 

Phe Arg Arg He Gin Asn Val Glu Gly Val Glu Val Thr Arg He Asn 
20 25 30 

Asp Leu Thr Asp Pro Val Met Leu Ala His Leu Leu Lys Tyr Asp Thr 
35 40 45 

Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly Phe 
50 55 60 

Glu Val Asn Gly Lys Phe He Lys Val Ser Ala Glu Arg Asp Pro Glu 
65 70 75 80 

Gin He Asp Trp Ala Thr Asp Gly Val Glu He Val Leu Glu Ala Thr 
85 90 95 

Gly Phe Phe Ala Lys Lys Glu Ala Ala Glu Lys His Leu Lys Gly Gly 
100 105 110 

Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val Lys Thr 
115 120 125 

Val Val Phe Asn Thr Asn His Asp Val Leu Asp Gly Thr Glu Thr Val 
130 135 140 

He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met Ala Lys 
145 150 155 160 
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Ala Leu Gin Asp Asn Phe Gly Val Val Glu Gly Leu Met Thr Thr lie 
165 170 175 

His Ala Tyr Thr Gly Asp Gin Met lie Leu Asp Gly Pro His Arg Gly 
180 185 190 

Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn lie Val Pro Asn 
195 200 205 

Ser Thr Gly Ala Ala Lys Ala lie Gly Leu Val lie Pro Glu Leu Asn 
210 215 220 

Gly Lys Leu Asp Gly Ser Ala Gin Arg Val Pro Thr Pro Thr Gly Ser 
225 230 235 240 

Val Thr Glu Leu Val Ala Val Leu Glu Lys Asn Val Thr Val Asp Glu 
245 250 255 

Val Asn Ala Ala Met Lys Ala Ala Ser Asn Glu Ser Tyr Gly Tyr Thr 
260 265 270 

Glu Asp Pro He Val Ser Ser Asp He Val Gly Met Ser Tyr Gly Ser 
275 280 285 

Leu Phe Asp Ala Thr Gin Thr Lys Val Leu Asp Val Asp Gly Lys Gin 
290 295 300 

Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr Thr Ala 
305 310 315 320 

Gin Leu Val Arg Thr Leu Gly He Leu Arg Lys Asn Cys 
325 330 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

TTCTTACGAG TTGGGACTGT ATCAAGCTAG AACGGTTAAG GAAAATAATC GTGTTTCCTA 60 
TATAGATGGA AAACAAGCGA CGCAAAAAAC GGAGAATTTG ACTCCTGATG AGGTTAGCAA . 120 

GCGTGAAGGA ATCAATGCTG AGCAAATCGT CATCAAGATA ACAGACCAAG GCTATGTCAC 180 

TTCACATGGC GACCACTATC ATTATTACAA TGGTAAGGTT CCTTATGACG CTATCATCAG 240 

TGAAGAATTA CTCATGAAAG ATCCAAACTA TAAGCTAAAA GATGAGGATA TTGTTAATGA 300 

GGTCAAGGGT GGATATGTTA TCAAGGTAGA TGGAAAATAC TATGTTTACC TTAAGGATGC 360 

TGCCCACGCG GATAACGTCC GTACAAAAGA GGAAATCAAT CGACAAAAAC AAGAGCATAG 420 

TCAACATCGT GAAGGTGGAA CTCCAAGAAA CGATGGTGCT GTTGCCTTGG CACGTTCGCA 480 

AGGACGCTAT ACTACAGATG ATGGTTATAT CTTTAATGCT TCTGATATCA TAGAGGATAC 540 

TGGTGATGCT TATATCGTTC CTCATGGAGA TCATTACCAT TACATTCCTA AGAATGAGTT 600 

ATCAGCTAGC GAGTTGGCTG CTGCAGAAGC CTTCCTATCT GGTCGAGGAA ATCTGTCAAA 660 
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TTCAAGAACC 


TATCGCCGAC 


AAAATAGCGA 


TAACACTTCA 


AGAACAAACT 


GGGTACCTTC 


720 


TGTAAGCAAT 


CCAGGAACTA 


CAAATACTAA 


CACAAGCAAC 


AACAGCAACA 


CTAACAGTCA 


780 


AGCAAGTCAA 


AGTAATGACA 


TTGATAGTCT 


CTTGAAACAG 


CTCTACAAAC 


TGCCTTTGAG 


840 


TCAACGACAT 


GTAGAATCTG 


ATGGCCTTGT 


CTTTGATCCA 


GCACAAATCA 


CAAGTCGAAC 


900 


AGCTAGAGGT 


GTTGCAGTGC 


CACACGGAGA 


TCATTACCAC 


TTCATCCCTT 


ACTCTCAAAT 


960 


GTCTGAATTG 


GAAGAACGAA 


TCGCTCGTAT 


TATTCCCCTT 


CGTTATCGTT 


CAAACCATTG 


1020 


GGTACCAGAT 


TCAAGGCCAG 


AACAACCAAG 


TCCACAACCG 


ACTCCGGAAC 


CTAGTCCAGG 


1080 


CCCGCAACCT 


GCACCAAATC 


TTAAAATAGA 


CTCAAATTCT 


TCTTTGGTTA 


GTCAGCTGGT 


1140 


ACGAAAAGTT 


GGGGAAGGAT 


ATGTATTCGA 


AGAAAAGGGC 


ATCTCTCGTT 


ATGTCTTTGC 


1200 


GAAAGATTTA 


CCATCTGAAA 


CTGTTAAAAA 


TCTTGAAAGC 


AAGTTATCAA 


AACAAGAGAG 


1260 


TGTTTCACAC 


ACTTTAACTG 


CTAAAAAAGA 


AAATGTTGCT 


CCTCGTGACC 


AAGAATTTTA 


1320 


TGATAAAGCA 


TATAATCTGT 


TAACTGAGGC 


TCATAAAGCC 


TTGTTTGNAA 


ATAAGGGTCG 


1380 


TAATTCTGAT 


TTCCAAGCCT 


TAGACAAATT 


ATTAGAACGC 


TTGAATGATG 


AATCGACTAA 


1440 


TAAAGAAAAA 


TTGGTAGATG 


ATTTATTGGC 


ATTCCTAGCA 


CCAATTACCC 


ATCCAGAGCG 


1500 


ACTTGGCAAA 


CCAAATTCTC 


AAATTGAGTA 


TACTGAAGAC 


GAAGTTCGTA 


TTGCTCAATT 


1560 


AGCTGATAAG 


TATACAACGT 


CAGATGGTTA 


CATTTTTGAT 


GAACATGATA 


TAATCAGTGA 


1620 


TGAAGGAGAT 


GCATATGTAA 


CGCCTCATAT 


GGGCCATAGT 


CACTGGATTG 


GAAAAGATAG 


1680 


CCTTTCTGAT 


AAGGAAAAAG 


TTGCAGCTCA 


AGCCTATACT 


AAAGAAAAAG 


GTATCCTACC 


1740 


TCCATCTCCA 


GACGCAGATG 


TTAAAGCAAA 


TCCAACTGGA 


GATAGTGCAG 


CAGCTATTTA 


1800 


CAATCGTGTG 


AAAGGGGAAA 


AACGAATTCC 


ACTCGTTCGA 


CTTCCATATA 


TGGTTGAGCA 


1860 


TACAGTTGAG 


GTTAAAAACG 


GTAATTTGAT 


TATTCCTCAT 


AAGGATCATT 


ACCATAATAT 


1920 


TAAATTTGCT 


TGGTTTGATG 


ATCACACATA 


CAAAGCTCCA 


AATGGCTATA 


CCTTGGAAGA 


1980 


TTTGTTTGCG 


ACGATTAAGT 


ACTACGTAGA 


ACACCCTGAC 


GAACGTCCAC 


ATTCTAATGA 


2040 


TGGATGGGGC 


AATGCCAGTG 


AGCATGTGTT 


AGGCAAGAAA 


GACCACAGTG 


AAGATCCAAA 


2100 


TAAGAACTTC 


AAAGCGGATG 


AAGAGCCAGT 


AGAGGAAACA 


CCTGCTGAGC 


CAGAAGTCCC 


2160 


TCAAGTAGAG 


ACTGAAAAAG 


TAGAAGCCCA 


ACTCAAAGAA 


GCAGAAGTTT 


TGCTTGCGAA 


2220 


AGTAACGGAT 


TCTAGTCTGA 


AAGCCAATGC 


AACAGAAACT 


CTAGCTGGTT 


TACGAAATAA 


2280 


TTTGACTCTT 


CAAATTATGG 


ATAACAATAG 


TATCATGGCA 


GAAGCAGAAA 


AATTACTTGC 


2340 


GTTGTTAAAA 


GGAAGTAATC 


CTTCATCTGT 


AAGTAAGGAA 


AAAATAAAC 




2389 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Tyr Glu Leu Gly Leu Tyr Gin Ala Arg Thr Val Lys Glu Asn Asn 
15 10 15 

Arg Val Ser Tyr lie Asp Gly Lys Gin Ala Thr Gin Lys Thr Glu Asn 
20 25 30 

Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly lie Asn Ala Glu Gin 
35 40 " 45 

lie Val lie Lys lie Thr Asp Gin Gly Tyr Val Thr Ser His Gly Asp 
50 55 60 

His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala lie lie Ser 
65 70 75 80 

Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Lys Leu Lys Asp Glu Asp 
85 90 95 

lie Val Asn Glu Val Lys Gly Gly Tyr Val lie Lys Val Asp Gly Lys 
100 105 110 

Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg Thr 
115 120 125 

Lys Glu Glu lie Asn Arg Gin Lys Gin Glu His Ser Gin His Arg Glu 
130 135 140 

Gly Gly Thr Pro Arg Asn Asp Gly Ala Val Ala Leu Ala Arg Ser Gin 
145 150 155 160 

Gly Arg Tyr Thr Thr Asp Asp Gly Tyr lie Phe Asn Ala Ser Asp lie 
165 170 175 

lie Glu Asp Thr Gly Asp Ala Tyr lie Val Pro His Gly Asp His Tyr 
180 185 190 

His Tyr lie Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala 
195 200 205 

Glu Ala Phe Leu Ser Gly Arg Gly Asn Leu Ser Asn Ser Arg Thr Tyr 
210 215 220 

Arg Arg Gin Asn Ser Asp Asn Thr Ser Arg Thr Asn Trp Val Pro Ser 
225 230 235 240 

Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser Asn Asn Ser Asn 
245 250 255 

Thr Asn Ser Gin Ala Ser Gin Ser Asn Asp lie Asp Ser Leu Leu Lys 
260 265 270 

Gin Leu Tyr Lys Leu Pro Leu Ser Gin Arg His Val Glu Ser Asp Gly 
275 280 285 

Leu Val Phe Asp Pro Ala Gin lie Thr Ser Arg Thr Ala Arg Gly Val 
290 295 300 
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Ala Val Pro His Gly Asp His Tyr His Phe lie Pro Tyr Ser Gin Met 
305 310 315 320 

Ser Glu Leu Glu Glu Arg lie Ala Arg lie lie Pro Leu Arg Tyr Arg 
325 330 335 

Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gin Pro Ser Pro Gin 
340 345 350 

Pro Thr Pro Glu Pro Ser Pro Gly Pro Gin Pro Ala Pro Asn Leu Lys 
355 360 365 

lie Asp Ser Asn Ser Ser Leu Val Ser Gin Leu Val Arg Lys Val Gly 
370 375 380 

Glu Gly Tyr Val Phe Glu Glu Lys Gly lie Ser Arg Tyr Val Phe Ala 
385 390 395 400 

Lys Asp Leu Pro Ser Glu Thr Val Lys Asn Leu Glu Ser Lys Leu Ser 
405 410 415 

Lys Gin Glu Ser Val Ser His Thr Leu Thr Ala Lys Lys Glu Asn Val 
420 425 430 

Ala Pro Arg Asp Gin Glu Phe Tyr Asp Lys Ala Tyr Asn Leu Leu Thr 
435 440 445 

Glu Ala His Lys Ala Leu Phe Xaa Asn Lys Gly Arg Asn Ser Asp Phe 
450 455 460 

Gin Ala Leu Asp Lys Leu Leu Glu Arg Leu Asn Asp Glu Ser Thr Asn 
465 470 475 480 

Lys Glu Lys Leu Val Asp Asp Leu Leu Ala Phe Leu Ala Pro lie Thr 
485 490 495 

His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gin lie Glu Tyr Thr Glu 
500 505 510 

Asp Glu Val Arg lie Ala Gin Leu Ala Asp Lys Tyr Thr Thr Ser Asp 
515 520 525 

Gly Tyr lie Phe Asp Glu His Asp lie lie Ser Asp Glu Gly Asp Ala 
530 535 540 

Tyr Val Thr Pro His Met Gly His Ser His Trp lie Gly Lys Asp Ser 
545 550 555 560 

Leu Ser Asp Lys Glu Lys Val Ala Ala Gin Ala Tyr Thr Lys Glu Lys 
565 570 575 

Gly lie Leu Pro Pro Ser Pro Asp Ala Asp Val Lys Ala Asn Pro Thr 
580 585 590 

Gly Asp Ser Ala Ala Ala lie Tyr Asn Arg Val Lys Gly Glu Lys Arg 
595 600 605 

lie Pro Leu Val Arg Leu Pro Tyr Met Val Glu His Thr Val Glu Val 
610 615 620 

Lys Asn Gly Asn Leu lie lie Pro His Lys Asp His Tyr His Asn lie 
625 630 635 640 
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Lys Phe Ala Trp Phe Asp Asp His Thr Tyr Lys Ala Pro Asn Gly Tyr 
645 650 655 

Thr Leu Glu Asp Leu Phe Ala Thr lie Lys Tyr Tyr Val Glu His Pro 
660 665 670 

Asp Glu Arg Pro His Ser Asn Asp Gly Trp Gly Asn Ala Ser Glu His 
675 680 685 

Val Leu Gly Lys Lys Asp His Ser Glu Asp Pro Asn Lys Asn Phe Lys 
690 695 700 

Ala Asp Glu Glu Pro Val Glu Glu Thr Pro Ala Glu Pro Glu Val Pro 
705 710 715 720 

Gin Val Glu Thr Glu Lys Val Glu Ala Gin Leu Lys Glu Ala Glu Val 
725 730 735 

Leu Leu Ala Lys Val Thr Asp Ser Ser Leu Lys Ala Asn Ala Thr Glu 
740 745 750 

Thr Leu Ala Gly Leu Arg Asn Asn Leu Thr Leu Gin lie Met Asp Asn 
755 760 765 

Asn Ser lie Met Ala Glu Ala Glu Lys Leu Leu Ala Leu Leu Lys Gly 
770 775 780 

Ser Asn Pro Ser Ser Val Ser Lys Glu Lys lie Asn 
785 790 795 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

TACTGAGATG CATCATAATC TAGGAGCTGA AAAGCGTTCA GCAGTGGCTA CTACTATCGA 60 

TAGTTTTAAG GAGCGAAGTC AAAAAGTCAG AGCACTATCT GATCCAAATG TGCGTTTTGT 120 

TCCCTTCTTT GGCTCTAGTG AATGGCTTCG TTTTGACGGT GCTCATTCTG CGGTATTAGC 180 

TGAGAAATAC AATCGTTCCT ACCGTCCTTA TCTTTTAGGA CAGGGGGGAG CTGCATCGCT 240 

TAACCAATAT TTTGGAATGC AACAGATGTT ACCACAGCTG GAGAATAAAC AAGTTGTGTA 3 00 

TGTTATCTCA CCTCAGTGGT TCAGTAAAAA TGGCTATGAT CCAGCAGCCT TCCAGCAGTA 360 

TTTTAATGGA GACCAGTTGA CTAGTTTTCT GAAACATCAA TCTGGGGATC AGGCTAGTCA 420 

ATATGCAGCG ACTCGCTTAC TGCAACAGTT CCCAAACGTA GCTATGAAGG ACCTGGTTCA 480 

GAAGTTGGCA AGTAAAGAAG AATTGTCGAC AGCAGACAAT GAAATGATTG AATTATTGGC 540 

TCGTTTTAAT GAACGCCAAG CTTCCTTTTT TGGTCAGTTT TCGGTTAGAG GCTATGTTAA 600 

CTACGATAAG CATGTAGCTA AGTATTTAAA AATCTTGCCA GACCAGTTTT CTTATCAGGC 660 
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AATAGAAGAT GTTGTCAAAG CAGATGCTGA AAAAAATACT TCCAATAATG AGATGGGAAT 720 

GGAAAATTAT TTCTATAATG AGCAGATCAA GAAGGATTTG AAGAAATTAA AGGATTCTCA 780 

GAAAAGCTTT ACCTATCTCA AGTCGCCAGA GTATAATGNN TTGCAGTTGG TTTTAACACA 840 

GTTTTCTAAA TCTAAGGTAA ACCCGATTTT TATCATTCCA CCTGTTAATA AAAAATGGAT 900 

GNACTATGCT GGTCTACGAG AGGATATGTA CCAACAAACG GTGCAGAAGA TTCGCTACCA 960 

GTTAGAAAGT CAAGGTTTTA CCAATATAGC AGATTTTTCT AAGGACGGCG GGGAGCCTTT 1020 

CTTTATGAAG GACACCATTC ACCTTGGTTG GTTGGGTTGG TTGGCTTTTG ACAAGGCAGT 1080 

TGATCCTTTC CTATCCAATC CCACACCAGC TCCGACTTAC CATCTGAATG AGCGCTTTTT 1140 

CAGCAAAGAT TGGGCGACTT ATGATGGAGA TGTCAAAGAA 1180 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Thr Glu Met His His Asn Leu Gly Ala Glu Lys Arg Ser Ala Val Ala 
15 10 15 

Thr Thr lie Asp Ser Phe Lys Glu Arg Ser Gin Lys Val Arg Ala Leu 
20 25 30 

Ser Asp Pro Asn Val Arg Phe Val Pro Phe Phe Gly Ser Ser Glu Trp 
35 40 45 

Leu Arg Phe Asp Gly Ala His Ser Ala Val Leu Ala Glu Lys Tyr Asn 
50 55 60 

Arg Ser Tyr Arg Pro Tyr Leu Leu Gly Gin Gly Gly Ala Ala Ser Leu 
65 70 75 80 

Asn Gin Tyr Phe Gly Met Gin Gin Met Leu Pro Gin Leu Glu Asn Lys 
85 90 95 

Gin Val Val Tyr Val lie Ser Pro Gin Trp Phe Ser Lys Asn Gly Tyr 
100 105 110 

' Asp Pro Ala Ala Phe Gin Gin Tyr Phe Asn Gly Asp Gin Leu Thr Ser 
115 120 125 

Phe Leu Lys His Gin Ser Gly Asp Gin Ala Ser Gin Tyr Ala Ala Thr 
130 135 140 

Arg Leu Leu Gin Gin Phe Pro Asn Val Ala Met Lys Asp Leu Val Gin 
145 150 155 160 

Lys Leu Ala Ser Lys Glu Glu Leu Ser Thr Ala Asp Asn Glu Met lie 
165 170 175 
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Glu Leu Leu Ala Arg Phe Asn GluArg Gin Ala Ser Phe Phe Gly Gin 
180 185 190 

Phe Ser Val Arg Gly Tyr Val Asn Tyr Asp Lys His Val Ala Lys Tyr 
195 200 205 

Leu Lys lie Leu Pro Asp Gin Phe Ser Tyr Gin Ala lie Glu Asp Val 
210 215 220 

Val Lys Ala Asp Ala Glu Lys Asn Thr Ser Asn Asn Glu Met Gly Met 
225 230 235 240 

Glu Asn Tyr Phe Tyr Asn Glu Gin lie Lys Lys Asp Leu Lys Lys Leu 
245 250 ' 255 

Lys Asp Ser Gin Lys Ser Phe Thr Tyr Leu Lys Ser Pro Glu Tyr Asn 
260 265 270 

Xaa Leu Gin Leu Val Leu Thr Gin Phe Ser Lys Ser Lys Val Asn Pro 
275 280 285 

lie Phe lie lie Pro Pro Val Asn Lys Lys Trp Met Xaa Tyr Ala Gly 
290 295 300 

Leu Arg Glu Asp Met Tyr Gin Gin Thr Val Gin Lys lie Arg Tyr Gin 
305 310 315 320 

Leu Glu Ser Gin Gly Phe Thr Asn lie Ala Asp Phe Ser Lys Asp Gly 
325 330 335 

Gly Glu Pro Phe Phe Met Lys Asp Thr lie His Leu Gly Trp Leu Gly 
340 345 350 

Trp Leu Ala Phe Asp Lys Ala Val Asp Pro Phe Leu Ser Asn Pro Thr 
355 360 365 

Pro Ala Pro Thr Tyr His Leu Asn Glu Arg Phe Phe Ser Lys Asp Trp 
370 375 380 

Ala Thr Tyr Asp Gly Asp Val Lys Glu 
385 390 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GGTTTTGAGA AAGTATTTGC AGGGGGCCCT GATTGAGTCG ATTGAGCAAG TGGAAAATGA 60 

CCGTATTGTG GAAATTACAG TTTCCAATAA AAACGAGATT GGAGACCATA TCCAGGCTAC 120 

CTTGATTATC GAAATTATGG GGAAACACAG TAATATTCTA CTGGTCGATA AAAGCAGTCA 180 

TAAAATCCTC GAAGTTATCA AACACGTCGG CTTTTCACAA AATAGCTACC GCACCTTACT 240 

TCCAGGATCG ACCTATATCG CTCCGCCAAG TACAAAATCT CTCAATCCTT TTACTATCAA 300 
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GGATGAAAAG 


C TC TTTG AAA 




AALjAAL. i A 


Ar^A/^/^AAAAA 

AL. ALrL. AAAAA 


Amr^mmr^A a a/^ 
A i L i i L AAAL 


o D u 


CCTCTTTCAA 


GGTCTGGGAC 


GL G A 1 AL ObL 


AAA i b AA I IKj 


A A A r^r^ A m A 
(jAAAvjVjA i AL. 


m/^omm A r^rr^r^ a 
i LL 1 i AL i LA 




AAAAC TTTC C 


GCTTTCCGAA 


AI 1 i i i i CAA 


1 L AAUAAAL- U 


AALtL-^^A i tjL. i 


1 LAL 1 LALAL 


A p n 

4t o U 


TTCCTTCAGT 


CC AGTTCC TT 


rnrp/^/^ TV tv a A 

i IGLAAAiL-A 


<^/^'T>/^r'/^ A O A 

tjVj J. (jLxLiAtaAbr 


OOfTimrpmoo tv a 


A mommmr^ m^ a 
AiLl i ILi LA 




TTTGTTGGAC 


AC C T AC TATA 


AGGATAAGGC 


i GAGCGCGAL 


oooomr^ a a a 
HjCvjiCAAAL 


A OP* A r^/^oo A 
ALL ALLL LAL 


D U U 


TGAACTGATT 


CGTCGTGTTG 


TV H TV TV TV TV m 

AAAATGAACT 


mOAr^A A A A AO 

TLAGAAAAAL 


OOAOAOA A A<^ 

L GAL AL AAAL 


mo AAAAAAOA 

iLAAAAAALA 


boU 


GGAAAAAGAG 


mm "jv m /** TV 

TTACTGGCGA 


-ji TV TV TV r>r^/~* 

C AG AC AAC GC 


mo A A o A A mmm 

TGAAGAA i 1 1 


OOmO A A A A A O 

LG i LAAAAAG 


OAOA Ammoom 
LALAA 1 i LL i 




GACAACCTTC 


CTCCACCAAG 


TGCCTAACGA 


OO "K TV O TV OO TV O 

C L AAG AC LAG 


o mm A moo m a o 

Gi 1 AILL iAG 


A o A A om A om A 

AL AAL 1 AL i A 


"7 Q r» 
/ o U 


TACCAACCAA 


^^y^ m TV Tv m/^ TV 

CCTATCATGA 


TTGCGCTTGA 


mA A ooomomo 

TAAGGC TL TG 


AOmOOOA AOO 

AL i LLLAALL 


A o A A n^/^r^/~*r^ a 
ALAA i LL L L A 


Q /I n 


ACGCTATTTT 


AAACGGTATC 


Tl TV TV TV /^m^^ TV TV 

AG AAAC TC AA 


TV /~i TV TV oomomo 

AG AAGC TGTC 


TV TV TV m TV omm/^ a 

AAATAL i 1 GA 


L i LA ill L A i 


Q n n 


TGAAGAAACC 


TV TV TV /^y^ TV m TV 

AAAGCCACTA 


TTCTCTATCT 


OOA A AOmomA 

GGAAAGTGTA 


OA A 7\ O O O rpO O 

GAAALLGTLL 


mo A AOOA AOO 

1 LAALLAALL 


y bu 


TGGACTGGAA 


TV TV TV m^^/^^^m^^ 

GAAATCGCTG 


TV TV TV m/~*f^/~trT\/~t TV 

AAATCCGTGA 


A o A A mmo A mm 

AGAATTGATT 


OAAAOAOO mm 
LAAALAGGi i 


mm A TT^c^r^r^r* a o 
i iAiLLLLAL 


1 n o n 


AAGACAACGG 


/^TV TV TV TV 71 m/^/^ 

GAG AAAATC C 


TV/^TV TV 7\ r^/~^ 7\ TV 

AGAAACGCAA 


AA A AOmAOA A 

AAAAL T AG AA 


o A A m A mom a o 
LAAi AIL lAL 


O A AOO/^AmOO 

L AALLLA i LL 


1 n Q n 
XUo U 


C AAAAC CATC 


ATCTATGTCG 


GAL G AAAL AA 


mommo a a a a m 
iC 1 iCAAAAJ. 


o A oo A A mm/^ A 
GALLAAi itjA 


o o mmm a a a a m 
LL 1 1 i AAAAi 




GGCCCGCAAG 


TV ^*t/^Tv TV ^^mmm 

GAGGAAC TTT 


GGTTCCATGC 


m A A oo A o A mm 

TAAGGALAl 1 


OOmOOA AOOO 

LL i GGAAGLL 


A rnonTTiorpo a m* 
AiG-L iLlLAi 


1 o n n 


CTCAGGAAAT 


CTTGACCCAT 


CTGATGCAGT 


CAAGACAGAC 


GCAGCAGAGT 


TAGCTGCCTA 


1260 


CTTCTCTCAA 


GGGCGCCTGT 


CGAATCTGGT 


GCAGGTAGAT 


ATGATTGAAG 


TCAAAAAACT 


1320 


CAATAAACCA 


ACTGGTGGAA 


AACCCGGCTT 


TGTCACTTAC 


ACAGGACAAA 


AGACCCTCCG 


1380 


CGTCACACCA 


GACTCCAAAA 


AAATTGCATC 


CATGAAAAAA 


TCC 




1423 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Val Leu Arg Lys Tyr Leu Gin Gly Ala Leu lie Glu Ser lie Glu Gin 
15 10 15 

Val Glu Asn Asp Arg lie Val Glu He Thr Val Ser Asn Lys Asn Glu 
20 25 30 

He Gly Asp His He Gin Ala Thr Leu He He Glu He Met Gly Lys 
35 40 45 

His Ser Asn He Leu Leu Val Asp Lys Ser Ser His Lys He Leu Glu 
50 55 60 

Val He Lys His Val Gly Phe Ser Gin Asn Ser Tyr Arg Thr Leu Leu 
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65 70 75 80 

Pro Gly Ser Thr Tyr lie Ala Pro Pro Ser Thr Lys Ser Leu Asn Pro 
85 90 95 

Phe Thr lie Lys Asp Glu Lys Leu Phe Glu lie Leu Gin Thr Gin Glu 
100 105 110 

Leu Thr Ala Lys Asn Leu Gin Ser Leu Phe Gin Gly Leu Gly Arg Asp 
115 120 125 

Thr Ala Asn Glu Leu Glu Arg lie Leu Val Ser Glu Lys Leu Ser Ala 
130 135 140 

Phe Arg Asn Phe Phe Asn Gin Glu Thr Lys Pro Cys Leu Thr Glu Thr 
145 150 155 160 

Ser Phe Ser Pro Val Pro Phe Ala Asn Gin Val Gly Glu Pro Phe Ala 
165 170 175 

Asn Leu Ser Asp Leu Leu Asp Thr Tyr Tyr Lys Asp Lys Ala Glu Arg 
180 185 190 

Asp Arg Val Lys Gin Gin Ala Ser Glu Leu lie Arg Arg Val Glu Asn 
195 200 205 

Glu Leu Gin Lys Asn Arg His Lys Leu Lys Lys Gin Glu Lys Glu Leu 
210 215 220 

Leu Ala Thr Asp Asn Ala Glu Glu Phe Arg Gin Lys Gly Glu Leu Leu 
225 230 235 240 

Thr Thr Phe Leu His Gin Val Pro Asn Asp Gin Asp Gin Val lie Leu 
245 250 255 

Asp Asn Tyr Tyr Thr Asn Gin Pro lie Met lie Ala Leu Asp Lys Ala 
260 265 270 

Leu Thr Pro Asn Gin Asn Ala Gin Arg Tyr Phe Lys Arg Tyr Gin Lys 
275 280 285 

Leu Lys Glu Ala Val Lys Tyr Leu Thr Asp Leu lie Glu Glu Thr Lys 
290 295 300 

Ala Thr lie Leu Tyr Leu Glu Ser Val Glu Thr Val Leu Asn Gin Ala 
305 310 315 320 

Gly Leu Glu Glu He Ala Glu He Arg Glu Glu Leu He Gin Thr Gly 
325 330 335 

Phe He Arg Arg Arg Gin Arg Glu Lys He Gin Lys Arg Lys Lys Leu 
340 345 350 

Glu Gin Tyr Leu Ala Ser Asp Gly Lys Thr He He Tyr Val Gly Arg 
355 360 365 

Asn Asn Leu Gin Asn Glu Glu Leu Thr Phe Lys Met Ala Arg Lys Glu 
370 375 380 

Glu Leu Trp Phe His Ala Lys Asp He Pro Gly Ser His Val Val He 
385 390 395 400 

Ser Gly Asn Leu Asp Pro Ser Asp Ala Val Lys Thr Asp Ala Ala Glu 
405 410 415 
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Leu Ala Ala Tyr Phe Ser Gin Gly 
420 

Asp Met lie Glu Val Lys Lys Leu 
435 440 

Gly Phe Val Thr Tyr Thr Gly Gin 
450 455 

Ser Lys Lys lie Ala Ser Met Lys 
465 470 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



Arg Leu Ser Asn Leu Val Gin Val 
425 430 

Asn Lys Pro Thr Gly Gly Lys Pro 
445 ■ 

Lys Thr Leu Arg Val Thr Pro Asp 
460 

Lys Ser 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

GACAACATTT ACTATCCATA CAGTAGAGTC AGCACCAGCA GAAGTGAAAG AAATTCTTGA 60 

AACAGTAGAA AAAGACAACA ATGGCTATAT TCCCAACCTA ATCGGTCTCT TGGCCAATGC 120 

CCCGACTGTT TTAGAAGCCT ACCAAATTGT CTCATCTATC CACCGTCGCA ACAGCCTGAC 180 

ACCCGTTGAG CGTGAAGTGG TGCAAATCAC GGCAGCCGTG ACCAATGGTT GTGCCTTCTG 240 

TGTCGCAGGT CACACAGCCT TTTCCATCAA ACAAATCCAG ATGAATGATG ACTTGATTCA 300 

AGCTCTTCGC AATCGTACTC CAATTGAAAC AGATCCTAAA TTGGATACCC TAGCTAAGTT 360 

TACCTTGGCA GTTATCAATA CCAAGGGTCG TGTAGGAGAT GAAGCCTTGT CTGAGTTTTT 420 

AGAAGCTGGC TACACTCAAC AAAATGCCTT GGATGTGGTT TTTGGTGTCA GCCTAGCAAT 480 

CCTCTGTAAC TATGCCAACA ACTTAGCTAA TACACCAATT AATCCAGAAT TGCAACCTTA 540 

TGCC 544 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Thr Thr Phe Thr lie His Thr Val Glu Ser Ala Pro Ala Glu Val Lys 
1 5 10 . 15 

Glu lie Leu Glu Thr Val Glu Lys Asp Asn Asn Gly Tyr lie Pro Asn 
20 25 30 
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Leu He Gly Leu Leu Ala Asn Ala Pro Thr Val Leu Glu Ala Tyr Gin 
35 40 45 

He Val Ser Ser He His Arg Arg Asn Ser Leu Thr Pro Val Glu Arg 
50 ' 55 60 

Glu Val Val Gin He Thr Ala Ala Val Thr Asn Gly Cys Ala Phe Cys 
65 70 75 80 

Val Ala Gly His Thr Ala Phe Ser He Lys Gin He Gin Met Asn Asp 
85 90 95 

Asp Leu He Gin Ala Leu Arg Asn Arg Thr Pro He Glu Thr Asp Pro 
100 105 110 

Lys Leu Asp Thr Leu Ala Lys Phe Thr Leu Ala Val He Asn Thr Lys 
115 120 125 

Gly Arg Val Gly Asp Glu Ala Leu Ser Glu Phe Leu Glu Ala Gly Tyr 
130 135 140 

Thr Gin Gin Asn Ala Leu Asp Val Val Phe Gly Val Ser Leu Ala He 
145 150 155 160 

Leu Cys Asn Tyr Ala Asn Asn Leu Ala Asn Thr Pro He Asn Pro Glu 
165 170 175 

Leu Gin Pro Tyr Ala 
180 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GGCTAAGGAA AGAGTGGATG TACTAGCTTA TAAACAGGGG TTGTTTGAAA CGAGAGAGCA 60 

GGCCAAGCGA GGTGTGATGG CTGGCCTAGT CGTAGCAGTC CTTAATGGAG AACGGTTTGA 120 

CAAGCCAGGA GAGAAAATTC CAGATGACAC CGAATTAAAA CTCAAGGGGG AGAAACTCAA 180 

GTATGTCAGC CGTGGTGGTT TGAAACTGGA AAAGGCCTTG CAGGTCTTTG ATTTGTCGGT 240 

GGATGGCGCG ACTACGATTG ATATCGGGGC CTCTACTGGA GGTTTTACCG ATGTCATGCT 300 

ACAGAATAGT GCCAAGTTGG TCTTTGCAGT CGATGTTGGT ACCAATCAGT TGGCTTGGAA 360 

ATTACGCCAA GACCCACGAG TTGTCAGCAT GGAGCAGTTC AATTTCCGCT ATGCTGAAAA 420 

GACTGATTTC GAGCAGGAGC CGAGCTTTGC CAGTATTGAT GTGAGTTTCA TTTCCCTTAG 480 

TCTGATTTTG CCAGCCTTGC ACCGTGTCTT GGCTGATCAA GGTCAGGTGG TAGCACTTGT 540 

CAAACCTCAG TTTGAGGCAG GACGTGAGCA GATTGGGAAA AATGGAATTA TTCGAGATGC 600 

TAAGGTTCAT CAGAATGTCC TTGAATCTGT AACAGCTATG GCAGTAGAGG TAGGTTTTTC 660 
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AGTCCTTGGC TTGGACTTTT CTCCCATCCA AGGTGGACAT GGAAATATTG AATTTTTAGC 



720 



GTATTTGAAA AAAGAAAAGT CAGCAAGCAA TCAGATTCTT GCTGAGATTA AAGAAGCAGT 



780 



AGAGAGGGCG CATAGTCAAT TTAAAAATGA A 



811 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

Ala Lys Glu Arg Val Asp Val Leu Ala Tyr Lys Gin Gly Leu Phe Glu 
15 10 15 

Thr Arg Glu Gin Ala Lys Arg Gly Val Met Ala Gly Leu Val Val Ala 
20 25 30 

Val Leu Asn Gly Glu Arg Phe Asp Lys Pro Gly Glu Lys lie Pro Asp 
35 40 45 

Asp Thr Glu Leu Lys Leu Lys Gly Glu Lys Leu Lys Tyr Val Ser Arg 
50 55 60 

Gly Gly Leu Lys Leu Glu Lys Ala Leu Gin Val Phe Asp Leu Ser Val 
65 70 75 80 

Asp Gly Ala Thr Thr lie Asp lie Gly Ala Ser Thr Gly Gly Phe Thr 
85 90 95 

Asp Val Met Leu Gin Asn Ser Ala Lys Leu Val Phe Ala Val Asp Val 
100 105 110 

Gly Thr Asn Gin Leu Ala Trp Lys Leu Arg Gin Asp Pro Arg Val Val 
115 120 125 

Ser Met Glu Gin Phe Asn Phe Arg Tyr Ala Glu Lys Thr Asp Phe Glu 
130 135 140 

Gin Glu Pro Ser Phe Ala Ser lie Asp Val Ser Phe lie Ser Leu Ser 
145 150 155 160 

Leu lie Leu Pro Ala Leu His Arg Val Leu Ala Asp Gin Gly Gin Val 
165 170 175 

Val Ala Leu Val Lys Pro Gin Phe Glu Ala Gly Arg Glu Gin lie Gly 
180 185 190 

Lys Asn Gly lie lie Arg Asp Ala Lys Val His Gin Asn Val Leu Glu 
195 200 205 

Ser Val Thr Ala Met Ala Val Glu Val Gly Phe Ser Val Leu Gly Leu 
210 215 220 

Asp Phe Ser Pro lie Gin Gly Gly His Gly Asn lie Glu Phe Leu Ala 
225 230 235 240 
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Tyr Leu Lys Lys Glu Lys Ser Ala Ser Asn Gin lie Leu Ala Glu lie 
245 250 255 

Lys Glu Ala Val Glu Arg Ala His Ser Gin Phe Lys Asn Glu 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



TTGTTCCTAT 


GAACTTGGTC 


GTCACCAAGC 


TGGTCAGGTT 


AAGAAAGAGT 


/**irnTv TV m^^^^ TV 

C T AATC G AGT 


60 


TTCTTATATA 


GATGGTGATC 


AGGCTGGTCA 


AAAGGCAGAA 


AACTTGACAC 


CAGATGAAGT 


120 


CAGTAAGAGG 


GAGGGGATCA 


ACGCCGAACA 


AATNGTNATC 


AAG ATT AC GG 


ATCAAGGTTA 


180 


TGTGACCTCT 


CATGGAGACC 


ATTATCATTA 


CTATAATGGC 


AAGGTTCCTT 


ATGATGCCAT 


240 


CATCAGTGAA 


GAGCTCCTCA 


TGAAAGATCC 


GAATTATCAG 


TTGAAGGATT 


CAGACATTGT 


300 


CAATGAAATC 


AAGGGTGGTT 


ATGTCATTAA 


GGTAAACGGT 


AAATACTATG 


mivTm T\ /I rr^rr*! tv tv 

TNTACCTTAA 


360 


GGATGCAGCT 


CATGCGGATA 


ATATTCGGAC 


AAAAGAAGAG 


ATTAAACGTC 


TV TV TV TV TV 

AGAAGCAGGA 


420 


ACGCAGTCAT 


AATCATAACT 


CAAGAGCAGA 


TAATGCTGTT 


GCTGCAGCCA 


GAGCCCAAGG 


480 


ACGTTATACA 


ACGGATGATG 


GGTATATCTT 


CAATGCATCT 


GATATCATTG 


TV TV TV 

AGGACACGGG 


540 


TGATGCTTAT 


ATCGTTCCTC 


ACGGCGACCA 


TTACCATTAC 


ATTCCTAAGA 


ATGAGTTATC 


600 


AGCTAGCGAG 


TTAGCTGCTG 


CAGAAGCCTA 


TTGGAATGGG 


AAGCAGGGAT 


CTCGTCCTTC 


660 


TTCAAGTTCT 


AGTTATAATG 


CAAATCCAGC 


TCAACCAAGA 


TTGTCAGAGA 


ACCACAATCT 


720 


GACTGTCACT 


CCAACTTATC 


ATCAAAATCA 


AGGGGAAAAC 


ATTTCAAGCC 


TTTTACGTGA 


780 


ATTGTATGCT 


AAACCCTTAT 


CAGAACGCCA 


TGTGGAATCT 


GATGGCCTTA 


TTTTCGACCC 


840 


AGCGCAAATC 


ACAAGTCGAA 


CCGCCAGAGG 


TGTAGCTGTC 


CCTCATGGTA 


ACCATTACCA 


900 


CTTTATCCCT 


TATGAACAAA 


TGTCTGAATT 


GGAAAAACGA 


ATTGCTCGTA 


TTATTCCCCT 


960 


TCGTTATCGT 


TCAAACCATT 


GGGTACCAGA 


TTCAAGACCA 


GAACAACCAA 


GTCCACAATC 


1020 


GACTCCGGAA 


CCTAGTCCAA 


GTCCGCAACC 


TGCACCAAAT 


CCTCAACCAG 


CTCCAAGCAA 


1080 


TCCAATTGAT 


GAGAAATTGG 


TCAAAGAAGC 


TGTTCGAAAA 


GTAGGCGATG 


GTTATGTCTT 


1140 


TGAGGAGAAT 


GGAGTTTCTC 


GTTATATCCC 


AGCCAAGGAT 


CTTTCAGCAG 


AAACAGCAGC 


1200 


AGGCATTGAT 


AGCAAACTGG 


CCAAGCAGGA 


AAGTTTATCT 


CATAAGCTAG 


GAGCTAAGAA 


1260 


AACTGACCTC 


CCATCTAGTG 


ATCGAGAATT 


TTACAATAAG 


GCTTATGACT 


TACTAGCAAG 


1320 


AATTCACCAA 


GATTTACTTG 


ATAATAAAGG 


TCGACAAGTT 


GATTTTGAGG 


CTTTGGATAA 


1380 
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CCTGTTGGAA CGACTCAAGG ATGTCNCAAG TGATAAAGTC AAGTTAGTGG ANGATATTCT 1440 

TGCCTTCTTA GCTCCGATTC GTCATCCAGA ACGTTTAGGA AAACCAAATG CGCAAATTAC 1500 

CTACACTGAT GATGAGATTC AAGTAGCCAA GTTGGCAGGC AAGTACACAA CAGAAGACGG 1560 

TTATATCTTT GATCCTCGTG ATATAACCAG TGATGAGGGG GATGCCTATG TAACTCCACA 1620 

TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT GAAGCTGAGA GAGCGGCAGC 1680 

CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG ACAGACCATC AGGATTCAGG 1740 

AAATACTGAG GCAAAAGGAG CAGAAGCTAT CTACAACCGC GTGAAAGCAG CTAAGAAGGT 1800 

GCCACTTGAT CGTATGCCTT ACAATCTTCA ATATACTGTA GAAGTCAAAA ACGGTAGTTT 1860 

AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT GAGTGGTTTG ACGAAGGCCT 1920 

TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG GCGACTGTCA AGTACTATGT 1980 

CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT GGTAACGCTA GCGACCATGT 2040 

TCAAAGAAAC AAAAATGGTC AAGCTGATAC CAATCAAACG GAAAAACCAA GCGAGGAGAA 2100 

ACCTCAGACA GAAAAACCTG AGGAAGAAAC CCCTCGAGAA GAGAAACCGC AAAGCGAGAA 2160 

ACCAGAGTCT CCAAAACCAA CAGAGGAACC AGAAGAATCA CCAGAGGAAT CAGAAGAACC 2220 

TCAGGTCGAG ACTGAAAAGG TTGAAGAAAA ACTGAGAGAG GCTGAAGATT TACTTGGAAA 2280 

AATCCAGGAT 2290 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Cys Ser Tyr Glu Leu Gly Arg His Gin Ala Gly Gin Val Lys Lys Glu 
1 5 10 15 

Ser Asn Arg Val Ser Tyr lie Asp Gly Asp Gin Ala Gly Gin Lys Ala 
20 25 30 

Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly lie Asn Ala 
35 40 45 

Glu Gin Xaa Val lie Lys lie Thr Asp Gin Gly Tyr Val Thr Ser His 
50 55 60 

Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala lie 
65 70 75 80 

lie Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin Leu Lys Asp 
85 90 95 

Ser Asp lie Val Asn Glu lie Lys Gly Gly Tyr Val He Lys Val Asn 
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100 105 110 

Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn lie 
115 120 125 

Arg Thr Lys Glu Glu lie Lys Arg Gin Lys Gin Glu Arg Ser His Asn 
130 135 140 

His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gin Gly 
145 150 155 160 

Arg Tyr Thr Thr Asp Asp Gly Tyr lie Phe Asn Ala Ser Asp lie lie 
165 170 175 

Glu Asp Thr Gly Asp Ala Tyr lie Val Pro His Gly Asp His Tyr His 
180 185 190 

Tyr lie Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala Glu 
195 200 205 

Ala Tyr Trp Asn Gly Lys Gin Gly Ser Arg Pro Ser Ser Ser Ser Ser 
210 215 220 

Tyr Asn Ala Asn Pro Ala Gin Pro Arg Leu Ser Glu Asn His Asn Leu 
225 230 235 240 

Thr Val Thr Pro Thr Tyr His Gin Asn Gin Gly Glu Asn lie Ser Ser 
245 250 255 

Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val Glu 
260 265 270 

Ser Asp Gly Leu lie Phe Asp Pro Ala Gin lie Thr Ser Arg Thr Ala 
275 280 285 

Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe lie Pro Tyr 
290 295 300 

Glu Gin Met Ser Glu Leu Glu Lys Arg lie Ala Arg lie lie Pro Leu 
305 310 315 320^ 

Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gin Pro 
325 330 335 

Ser Pro Gin Ser Thr Pro Glu Pro Ser Pro Ser Pro Gin Pro Ala Pro 
340 345 350 

Asn Pro Gin Pro Ala Pro Ser Asn Pro lie Asp Glu Lys Leu Val Lys 
355 360 365 

Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly 
370 375 380 

Val Ser Arg Tyr lie Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala 
385 390 395 400 

Gly lie Asp Ser Lys Leu Ala Lys Gin Glu Ser Leu Ser His Lys Leu 
405 410 415 

Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn 
420 425 430 

Lys Ala Tyr Asp Leu Leu Ala Arg lie His Gin Asp Leu Leu Asp Asn 
435 440 445 
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Lys Gly Arg Gin Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg 
450 455 460 

Leu Lys Asp Val Xaa Ser Asp Lys Val Lys Leu Val Xaa Asp lie Leu 
465 470 475 480 

Ala Phe Leu Ala Pro lie Arg His Pro Glu Arg Leu Gly Lys Pro Asn 
485 490 495 

Ala Gin lie Thr Tyr Thr Asp Asp Glu lie Gin Val Ala Lys Leu Ala 
500 505 510 

Gly Lys Tyr Thr Thr Glu Asp Gly Tyr lie Phe Asp Pro Arg Asp lie 
515 520 525 

Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser 
530 535 540 

His Trp lie Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala 
545 550 555 560 

Gin Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His 
565 570 575 

Gin Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala lie Tyr Asn 
580 585 590 

Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn 
595 600 605 

Leu Gin Tyr Thr Val Glu Val Lys Asn Gly Ser Leu lie lie Pro His 
610 615 620 

Tyr Asp His Tyr His Asn lie Lys Phe Glu Trp Phe Asp Glu Gly Leu 
625 630 635 640 

Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val 
645 650 655 

Lys Tyr Tyr Val . Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly 
660 665 670 

Phe Gly Asn Ala Ser Asp His Val Gin Arg Asn Lys Asn Gly Gin Ala 
675 680 685 

Asp Thr Asn Gin Thr Glu Lys Pro Ser Glu Glu Lys Pro Gin Thr Glu 
690 695 700 

Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gin Ser Glu Lys 
705 710 715 720 

Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Ser Pro Glu Glu 
725 730 735 

Ser Glu Glu Pro Gin Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg 
740 745 750 

Glu Ala Glu Asp Leu Leu Gly Lys lie Gin Asp 
755 760 

INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTATAAGGGT GAATTAGAAA AAGGATACCA ATTTGATGGT TGGGAAATTT CTGGTTTCGA 60 

AGGTAAAAAA GACGCTGGCT ATGTTATTAA TCTATCAAAA GATACCTTTA TAAAACCTGT 120 

ATTCAAGAAA ATAGAGGAGA AAAAGGAGGA AGAAAATAAA CCTACTTTTG ATGTATCGAA 180 

AAAGAAAGAT AACCCACAAG TAAACCATAG TCAATTAAAT GAAAGTCACA GAAAAGAGGA 240 

TTTACAAAGA GAAGAGCATT CACAAAAATC TGATTCAACT AAGGATGTTA CAGCTACAGT 300 

TCTTGATAAA AACAATATCA GTAGTAAATC AACTACTAAC AATCCTAATA AG 352 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Tyr Lys Gly Glu Leu Glu Lys Gly Tyr Gin Phe Asp Gly Trp Glu lie 
15 10 15 

Ser Gly Phe Glu Gly Lys Lys Asp Ala Gly Tyr Val lie Asn Leu Ser 
20 25 30 

Lys Asp Thr Phe lie Lys Pro Val Phe Lys Lys lie Glu Glu Lys Lys 
35 40 45 

Glu Glu Glu Asn Lys Pro Thr Phe Asp Val Ser Lys Lys Lys Asp Asn 
50 55 60 

Pro Gin Val Asn His Ser Gin Leu Asn Glu Ser His Arg Lys Glu Asp 
65 70 75 80 

Leu Gin Arg Glu Glu His Ser Gin Lys Ser Asp Ser Thr Lys Asp Val 
85 90 95 

Thr Ala Thr Val Leu Asp Lys Asn Asn lie Ser Ser Lys Ser Thr Thr 
100 105 110 

Asn Asn Pro Asn Lys 
115 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GAATGTTCAG GCTCAAGAAA GTTCAGGAAA TAAAATCCAC TTTATCAATG TTCAAGAAGG 60 

TGGCAGTGAT GCGATTATTC TTGAAAGCAA TGGACATTTT GCCATGGTGG ATACAGGAGA 120 

AGATTATGAT TTCCCAGATG GAAGTGATTC TCGCTATCCA TGGAGAGAAG GAATTGAAAC 180 

GTCTTATAAG CATGTTCTAA CAGACCGTGT CTTTCGTCGT TTGAAGGAAT TGGGTGTCCA 240 

AAAACTTGAT TTTATTTTGG TGACCCATAC CCACAGTGAT CATATTGGAA ATGTTGATGA 300 

ATTACTGTCT ACCTATCCAG TTGACCGAGT CTATCTTAAG AAATATAGTG ATAGTCGTAT 360 

TACTAATTCT GAACGTCTAT GGGATAATCT GTATGGCTAT GATAAGGTTT TACAGACTGC 420 

TGCAGAAAAA GGTGTTTCAG TTATTCAAAA TATCACACAA GGGGATGCTC ATTTTCAGTT 480 

TGGGGACATG GATATTCAGC TCTATAATTA TGAAAATGAA ACTGATTCAT CGGGTGAATT 540 

AAAGAAAATT TGGGATGACA ATTCCAATTC CTTGATTAGC GTGGTGAAAG TCAATGGCAA 600 

GAAAATTTAC CTTGGGGGCG ATTTAGATAA TGTTCATGGA GCAGAAGACA AGTATGGTCC 660 

TCTCATTGGA AAAGTTGATT TGATGAAGTT TAATCATCAC CATGATACCA ACAAATCAAA 720 

TACCAAGGAT TTCATTAAAA ATTTGAGTCC GAGTTTGATT GTTCAAACTT CGGATAGTCT 780 

ACCTTGGAAA AATGGTGTTG ATAGTGAGTA TGTTAATTGG CTCAAAGAAC GAGGAATTGA 840 

GAGAATCAAC GCAGCCAGCA AAGACTATGA TGCAACAGTT TTTGATATTC GAAAAGACGG 900 

TTTTGTCAAT ATTTCAACAT CCTACAAGCC GATTCCAAGT TTTCAAGCTG GTTGGCATAA 960 

GAGTGCATAT GGGAACTGGT GGTATCAAGC GCCTGATTCT ACAGGAGAGT ATGCTGTCGG 1020 

TTGGAATGAA ATCGAAGGTG AATGGTATTA CTTTAACCAA ACGGGTATCT TGTTACAGAA 1080 

TCAATGGAAA AAATGGAACA ATCATTGGTT CTATTTGACA GACTCTGGTG CTTCTGCTAA 1140 

AAATTGGAAG AAAATCGCTG GAATCTGGTA TTATTTTAAC AAAGAAAACC AGATGGAAAT 1200 

TGGTTGGATT CAAGATAAAG AGCAGTGGTA TTATTTGGAT GTTGATGGTT CTATGAAGAC 1260 

AGGATGGCTT CAATATATGG GGCAATGGTA TTACTTTGCT CCATCAGGGG AA 1312 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Asn Val Gin Ala Gin Glu Ser Ser Gly Asn Lys lie His Phe lie Asn 
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15 10 15 

Val Gin Glu Gly Gly Ser Asp Ala lie lie Leu Glu Ser Asn Gly Hi's 
20 25 30 

Phe Ala Met Val Asp Thr Gly Glu Asp Tyr Asp Phe Pro Asp Gly Ser 
35 40 45 

Asp Ser Arg Tyr Pro Trp Arg Glu Gly lie Glu Thr Ser Tyr Lys His 
50 55 60 

Val Leu Thr Asp Arg Val Phe Arg Arg Leu Lys Glu Leu Gly Val Gin 
65 70 75 80 

Lys Leu Asp Phe lie Leu Val Thr His Thr His Ser Asp His lie Gly 
85 90 95 

Asn Val Asp Glu Leu Leu Ser Thr Tyr Pro Val Asp Arg Val Tyr Leu 
100 105 110 

Lys Lys Tyr Ser Asp Ser Arg lie Thr Asn Ser Glu Arg Leu Trp Asp 
115 120 125 

Asn Leu Tyr Gly Tyr Asp Lys Val Leu Gin Thr Ala Ala Glu Lys Gly 
130 135 140 

Val Ser Val lie Gin Asn lie Thr Gin Gly Asp Ala His Phe Gin Phe 
145 150 155 160 

Gly Asp Met Asp lie Gin Leu Tyr Asn Tyr Glu Asn Glu Thr Asp Ser 
165 170 175 

Ser Gly Glu Leu Lys Lys lie Trp Asp Asp Asn Ser Asn Ser. Leu lie 
180 185 190 

Ser Val Val Lys Val Asn Gly Lys Lys lie Tyr Leu Gly Gly Asp Leu 
195 200 205 

Asp Asn Val His Gly Ala Glu Asp Lys Tyr Gly Pro Leu lie Gly Lys 
210 215 220 

Val Asp Leu Met Lys Phe Asn His His His Asp Thr Asn Lys Ser Asn 
225 230 235 240 

Thr Lys Asp Phe lie Lys Asn Leu Ser Pro Ser Leu lie Val Gin Thr 
245 250 255 

Ser Asp Ser Leu Pro Trp Lys Asn Gly Val Asp Ser Glu Tyr Val Asn 
260 265 270 

Trp Leu Lys Glu Arg Gly lie Glu Arg lie Asn Ala Ala Ser Lys Asp 
275 280 285 

Tyr Asp Ala Thr Val Phe Asp lie Arg Lys Asp Gly Phe Val Asn lie 
290 295 300 

Ser Thr Ser Tyr Lys Pro lie Pro Ser Phe Gin Ala Gly Trp His Lys 
305 310 315 320 

Ser Ala Tyr Gly Asn Trp Trp Tyr Gin Ala Pro Asp Ser Thr Gly Glu 
325 330 335 

Tyr Ala Val Gly Trp Asn Glu lie Glu Gly Glu Trp Tyr Tyr Phe Asn 
340 345 350 



f 
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Gin Thr Gly lie Leu Leu Gin Asn Gin Trp Lys Lys Trp Asn Asn His 
355 360 365 

Trp Phe Tyr Leu Thr Asp Ser Gly Ala Ser Ala Lys Asn Trp Lys Lys 
370 375 380 

lie Ala Gly lie Trp Tyr Tyr Phe Asn Lys Glu Asn Gin Met Glu lie 
385 390 395 400 

Gly Trp lie Gin Asp Lys Glu Gin Trp Tyr Tyr Leu Asp Val Asp Gly 
405 410 415 

Ser Met Lys Thr Gly Trp Leu Gin Tyr Met Gly Gin Trp Tyr Tyr Phe 
420 425 430 

Ala Pro Ser Gly Glu 
435 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CTTGGGTGTA ACCCATATCC AGCTCCTTCC AGTCTTGTCT TACTACTTTG TCAATGAATT 60 

GAAAAACCAT GAACGCTTGT CTGACTACGC TTCAAGCAAC AGCAACTACA ACTGGGGATA 120 

TGACCCTCAA AACTACTTCT CCTTGACTGG TATGTACTCA AGCGATCCTA AGAATCCAGA 180 

AAAACGAATC GCAGAATTTA AAAACCTCAT CAACGAAATC CACAAACGTG GTATGGGAGC 240 

TATCCTAGAT GTCGTTTATA ACCACACAGC CAAAGTCGAT CTCTTTGAAG ATTTGGAACC 300 

AAACTACTAC CACTTTATGG ATGCCGATGG CACACCTCGA ACTAGCTTTG GTGGTGGACG 360 

CTTGGGGACA ACCCACCATA TGACCAAACG GCTCCTAATT GACTCTATCA AATACCTAGT 420 

TGATACCTAC AAAGTGGATG GCTTCCGTTT CGATATGATG GGAGACCATG ACGCCGCTTC 480 

TATCGAAGAA GCTTACAAGG CTGCACGCGC CCTCAATCCA AACCTCATCA TGCTTGGTGA 540 

AGGTTGGAGA ACCTATGCCG GTGATGAAAA CATGCCTACT AAAGCTGCTG ACCAAGATTG 600 

GATGAAACAT ACCGATACTG TCGCTGTCTT TTCAGATGAC ATCCGTAACA ACCTCAAATC 660 

TGGTTATCCA AACGAAGGTC AACCTGCCTT TATCACAGGT GGCAAGCGTG ATGTCAACAC 720 

CATCTTTAAA AATCTCATTG CTCAACCAAC TAACTTTGAA GCTGACAGCC CTGGAGATGT 780 

CATCCAATAC ATCGCAGCCC ATGATAACTT GACCCTCTTT GACATCATTG CCCAGTCTAT 840 

CAAAAAAGAC CCAAGCAAGG CTGAGAACTA TGCTGAAATC CACCGTCGTT TACGACTTGG 900 

AAATCTCATG GTCTTGACAG CTCAAGGAAC TCCATTTATC CACTCCGGTC AGGAATATGG 960 

ACGTACTAAA CAATTCCGTG ACCCAGCCTA CAAGACTCCA GTAGCAGAGG ATAAGGTTCC 1020 
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AAACAAATCT CACTTGTTGC GTGATAAGGA CGGCAACCCA TTTGACTATC CTTACTTCAT 1080 

CCATGACTCT TACGATTCTA GTGATGCAGT CAACAAGTTT GACTGGACTA AGGCTACAGA 1140 

TGGTAAAGCT TATCCTGAAA ATGTCAAGAG CCGTGACTAT ATGAAAGGTT TGATTGCCCT 1200 

TCGTCAATCT ACAGATGCCT TCCGACTTAA GAGTCTTCAA GATATCAAAG ACCGTGTCCA 1260 

CCTCATCACT GTCCCAGGCC AAAATGGTGT GGAAAAAGAG GATGTAGTGA TTGGCTACCA 1320 

AATCACTGCT CCAAACGGCG ATATCTACGC AGTCTTTGTC AATGCGGATG AAAAAGCTCG 1380 

CGAATTTAAT TTGGGAACTG CCTTTGCACA TCTAAGAAAT GCGGAAGTTT TGGCAGATGA 1440 

AAACCAAGCA GGACCAGTCG GAATTGCCAA CCCGAAAGGA CTTGAATGGA CTGAAAAAGG 1500 

CTTGAAATTG AATGCCCTTA CAGCTACTGT TCTTCGAGTC TCTCAAAATG GAACTAGCCA 1560 

TGAGTCAACT GCAGAAGAGA AACCAGACTC AACCCCTTCC AAGCCTGAAC ATCAAAATGA 1620 

AGCTTCTCAC CCTGCACATC AAGACCCAGC TCCAGAAGCT AGACCTGATT CTACTAAACC 1680 

AGATGCCAAA GTAGCTGATG CGGAAAATAA ACCTAGCCAA GCTACAGCTG ATTCACAAGC 1740 

TGAACAACCA GCACAAGAAG CACAAGCATC ATCTGTAAAA GAAGCGGTTC GAAACGAATC 1800 

GGTAGAAAAC TCTAGCAAGG AAAATATACC TGCAACCCCA GATAAACAAG CTGAA 1855 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Leu Gly Val Thr His lie Gin Leu Leu Pro Val Leu Ser Tyr Tyr Phe 
15 10 15 

Val Asn Glu Leu Lys Asn His Glu Arg Leu Ser Asp Tyr Ala Ser Ser 
20 25 30 

Asn Ser Asn Tyr Asn Trp Gly Tyr Asp Pro Gin Asn Tyr Phe Ser Leu 
35 40 45 

Thr Gly Met Tyr Ser Ser Asp Pro Lys Asn Pro Glu Lys Arg lie Ala 
50 55 60 

Glu Phe Lys Asn Leu lie Asn Glu lie His Lys Arg Gly Met Gly Ala 
65 70 75 80 

lie Leu Asp Val Val Tyr Asn His Thr Ala Lys Val Asp Leu Phe Glu 
85 90 95 

Asp Leu Glu Pro Asn Tyr Tyr His Phe Met Asp Ala Asp Gly Thr Pro 
100 105 110 

Arg Thr Ser Phe Gly Gly Gly Arg Leu Gly Thr Thr His His Met Thr 
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115 120 125 

Lys Arg Leu Leu lie Asp Ser lie Lys Tyr Leu Val Asp Thr Tyr Lys 
130 135 140 

Val Asp Gly Phe Arg Phe Asp Met Met Gly Asp His Asp Ala Ala Ser 
145 150 155 160 

lie Glu Glu Ala Tyr Lys Ala Ala Arg Ala Leu Asn Pro Asn Leu lie 
165 170 175 

Met Leu Gly Glu Gly Trp Arg Thr Tyr Ala Gly Asp Glu Asn Met Pro 
180 185 190 

Thr Lys Ala Ala Asp Gin Asp Trp Met Lys His Thr Asp Thr Val Ala 
195 200 205 

Val Phe Ser Asp Asp lie Arg Asn Asn Leu Lys Ser Gly Tyr Pro Asn 
210 215 220 

Glu Gly Gin Pro Ala Phe lie Thr Gly Gly Lys Arg Asp Val Asn Thr 
225 230 235 240 

lie Phe Lys Asn Leu lie Ala Gin Pro Thr Asn Phe Glu Ala Asp Ser 
245 250 255 

Pro Gly Asp Val lie Gin Tyr lie Ala Ala His Asp Asn Leu Thr Leu 
260 265 270 

Phe Asp lie lie Ala Gin Ser lie Lys Lys Asp Pro Ser Lys Ala Glu 
275 280 285 

Asn Tyr Ala Glu lie His Arg Arg Leu Arg Leu Gly Asn Leu Met Val 
290 295 300 

Leu Thr Ala Gin Gly Thr Pro Phe lie His Ser Gly Gin Glu Tyr Gly 
305 310 315 320 

Arg Thr Lys Gin Phe Arg Asp Pro Ala Tyr Lys Thr Pro Val Ala Glu 
325 330 335 

Asp Lys Val Pro Asn Lys Ser His Leu Leu Arg Asp Lys Asp Gly Asn 
340 345 350 

Pro Phe Asp Tyr Pro Tyr Phe lie His Asp Ser Tyr Asp Ser Ser Asp 
355 360 365 

Ala Val Asn Lys Phe Asp Trp Thr Lys Ala Thr Asp Gly Lys Ala Tyr 
370 375 380 

Pro Glu Asn Val Lys Ser Arg Asp Tyr Met Lys Gly Leu lie Ala Leu 
385 390 395 400 

Arg Gin Ser Thr Asp Ala Phe Arg Leu Lys Ser Leu Gin Asp lie Lys 
405 410 415 

Asp Arg Val His Leu lie Thr Val Pro Gly Gin Asn Gly Val Glu Lys 
420 425 430 

Glu Asp Val Val lie Gly Tyr Gin lie Thr Ala Pro Asn Gly Asp lie 
435 440 445 

Tyr Ala Val Phe Val Asn Ala Asp Glu Lys Ala Arg Glu Phe Asn Leu 
450 455 460 
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Gly Thr Ala Phe Ala His Leu Arg Asn Ala Glu Val Leu Ala Asp Glu 
465 470 475 480 

Asn Gin Ala Gly Pro Val Gly lie Ala Asn Pro Lys Gly Leu Glu Trp 
485 490 495 

Thr Glu Lys Gly Leu Lys Leu Asn Ala Leu Thr Ala Thr Val Leu Arg 
500 505 510 

Val Ser Gin Asn Gly Thr Ser His Glu Ser Thr Ala Glu Glu Lys Pro 
515 520 525 

Asp Ser Thr Pro Ser Lys Pro Glu His Gin Asn Glu Ala Ser His Pro 
530 535 540 

Ala His Gin Asp Pro Ala Pro Glu Ala Arg Pro Asp Ser Thr Lys Pro 
545 550 555 560 

Asp Ala Lys Val Ala Asp Ala Glu Asn Lys Pro Ser Gin Ala Thr Ala 
565 570 575 

Asp Ser Gin Ala Glu Gin Pro Ala Gin Glu Ala Gin Ala Ser Ser Val 
580 585 590 

Lys Glu Ala Val Arg Asn Glu Ser Val Glu Asn Ser Ser Lys Glu Asn 
595 600 605 

lie Pro Ala Thr Pro Asp Lys Gin Ala Glu 
610 615 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

TAGTGATGGT ACTTGGCAAG GAAAACAGTA TCTGAAAGAA GATGGCAGTC AAGCAGCAAA 60 

TGAGTGGGTT TTNGATACTC ATTATCAATC TTGGTTCTAT ATAAAAGCAG ATGCTAACTA 120 

TGCTGAAAAT GAATGGCTAA AGCAAGGTGA CGACTATTTT TACCTCAAAT CTGGTGGCTA 180 

TATGGCCAAA TCAGAATGGG TAGAAGACAA GGGAGCCTTT TATTATCTTG ACCAAGATGG 240 

AAAGATGAAA AGAAATGCTT GGGTAGGAAC TTCCTATGTT GGTGCAACAG GTGCCAAAGT 300 

AATAGAAGAC TGGGTCTATG ATTCTCAATA CGATGCTTGG TTTTATATCA AAGCAGATGG 360 

ACAGCACGCA GAGAAAGAAT GGCTCCAAAT TAAAGGGAAG GACTATTATT TCAAATCCGG 420 

TGGTTATCTA CTGACAAGTC AGTGGATTAA TCAAGCTTAT GTGAATGCTA GTGGTGCCAA 480 

AGTACAGCAA GGTTGGCTTT TTGACAAACA ATACCAATCT TGGTTTTACA TCAAAGAAAA 540 

TGGAAACTAT GCTGATAAAG AATGGATTTT CGAGAATGGT CACTATTATT ATCTAAAATC 600 

CGGTGGCTAC ATGGCAGCCA ATGAATGGAT TTGGGATAAG GAATCTTGGT TTTATCTCAA 660 
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ATTTGATGGG 


AAAATGGCTG 


AAAAAGAATG 


GGTCTACGAT 


TCTCATAGTC 


AAGCTTGGTA 


720 


CTACTTCAAA 


TCCGGTGGTT 


ACATGACAGC 


CAATGAATGG 


ATTTGGGATA 


AGGAATCTTG 


780 


GTTTTACCTC 


AAATCTGATG 


GGAAAATAGC 


TGAAAAAGAA 


TGGGTCTACG 


ATTCTCATAG 


840 


TCAAGCTTGG 


TACTACTTCA 


AATCTGGTGG 


CTACATGGCG 


AAAAATGAGA 


CAGTAGATGG 


900 


TTATCAGCTT 


GGAAGCGATG 


GTAAATGGCT 


TGGAGGAAAA 


ACTACAAATG 


AAAATGCTGC 


960 


TTACTATCAA 


GTAGTGCCTG 


TTACAGCCAA 


TGTTTATGAT 


TCAGATGGTG 


AAAAGCTTTC 


1020 


CTATATATCG 


CAAGGTAGTG 


TCGTATGGCT 


AGATAAGGAT 


AGAAAAAGTG 


ATGACAAGCG 


1080 


CTTGGCTATT 


ACTATTTCTG 


GTTTGTCAGG 


CTATATGAAA 


ACAGAAGATT 


TACAAGCGCT 


1140 


AGATGCTAGT 


AAGGACTTTA 


TCCCTTATTA 


TGAGAGTGAT 


GGCCACCGTT 


TTTATCACTA 


1200 


TGTGGCTCAG 


AATGCTAGTA 


TCCCAGTAGC 


TTCTCATCTT 


TCTGATATGG 


AAGTAGGCAA 


1260 


GAAATATTAT 


TCGGCAGATG 


GCCTGCATTT 


TGATGGTTTT 


AAGCTTGAGA 


ATCCCTTCCT 


1320 


TTTCAAAGAT 


TTAACAGAGG 


CTACAAACTA 


CAGTGCTGAA 


GAATTGGATA 


AGGTATTTAG 


1380 


TTTGCTAAAC 


ATTAACAATA 


GCCTTTTGGA 


GAACAAGGGC 


GCTACTTTTA 


AGGAAGCCGA 


1440 


AGAACATTAC 


CATATCAATG 


CTCTTTATCT 


CCTTGCCCAT 


AGTGCCCTAG 


AAAGTAACTG 


1500 


GGGAAGAAGT 


AAAATTGCCA 


AAGATAAGAA 


TAATTTCTTT 


GGCATTACAG 


CCTATGATAC 


1560 


GACCCCTTAC 


CTTTCTGCTA 


AGACATTTGA 


TGATGTGGAT 


AAGGGAATTT 


TAGGTGCAAC 


1620 


CAAGTGGATT 


AAGGAAAATT 


ATATCGATAG 


GGGAAGAAGT 


TTCCTTGGAA 


ACAAGGCTTC 


1680 


TGGTATGAAT 


GTGGAATATG 


CTTCAGACCC 


TTATTGGGGC 


GAAAAAATTG 


CTAGTGTGAT 


1740 


GATGAAAATC 


AATGAGAAGC 


TAGGTGGCAA 


AGAT 






1774 


(2) INFORMATION FOR SEQ ID NO: 74; 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ser Asp Gly Thr Trp Gin Gly Lys Gin Tyr Leu Lys Glu Asp Gly Ser 
15 10 15 

Gin Ala Ala Asn Glu Trp Val Xaa Asp Thr His Tyr Gin Ser Trp Phe 
20 25 30 

Tyr lie Lys Ala Asp Ala Asn Tyr Ala Glu Asn Glu Trp Leu Lys Gin 
35 40 45 

Gly Asp Asp Tyr Phe Tyr Leu Lys Ser Gly Gly Tyr Met Ala Lys Ser 
50 55 60 
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Glu Trp Val Glu Asp Lys Gly Ala Phe Tyr Tyr Leu Asp Gin Aspi Gly 
65 70 75 80 

Lys Met Lys Arg Asn Ala Trp Val Gly Thr Ser Tyr Val Gly Ala Thr 
85 90 95 

Gly Ala Lys Val lie Glu Asp Trp Val Tyr Asp Ser Gin Tyr Asp Ala 
100 105 110 

Trp Phe Tyr lie Lys Ala Asp Gly Gin His Ala Glu Lys Glu Trp Leu 
115 120 125 

Gin lie Lys Gly Lys Asp Tyr Tyr Phe Lys Ser Gly Gly Tyr Leu Leu 
130 135 140 

Thr Ser Gin Trp lie Asn Gin Ala Tyr Val Asn Ala Ser Gly Ala Lys 
145 150 155 160 

Val Gin Gin Gly Trp Leu Phe Asp Lys Gin Tyr Gin Ser Trp Phe Tyr 
165 170 175 

lie Lys Glu Asn Gly Asn Tyr Ala Asp Lys Glu Trp lie Phe Glu Asn 
180 185 190 

Gly His Tyr Tyr Tyr Leu Lys Ser Gly Gly Tyr Met Ala Ala Asn Glu 
195 200 205 

Trp lie Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Phe Asp Gly Lys 
210 215 220 

Met Ala Glu Lys Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr 
225 230 235 240 

Tyr Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp lie Trp Asp 
245 250 255 

Lys Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys lie Ala Glu Lys 
260 265 270 

Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr Phe Lys Ser 
275 280 285 

Gly Gly Tyr Met Ala Lys Asn Glu Thr Val Asp Gly Tyr Gin Leu Gly 
290 295 300 

Ser Asp Gly Lys Trp Leu Gly Gly Lys Thr Thr Asn Glu Asn Ala Ala 
305 310 315 320 

Tyr Tyr Gin Val Val Pro Val Thr Ala Asn Val Tyr Asp Ser Asp Gly 
325 330 335 

Glu Lys Leu Ser Tyr lie Ser Gin Gly Ser Val Val Trp Leu Asp Lys 
340 345 350 

Asp Arg Lys Ser Asp Asp Lys Arg Leu Ala lie Thr lie Ser Gly Leu 
355 360 365 

Ser Gly Tyr Met Lys Thr Glu Asp Leu Gin Ala Leu Asp Ala Ser Lys 
370 375 380 

Asp Phe lie Pro Tyr Tyr Glu Ser Asp Gly His Arg Phe Tyr His Tyr 
385 390 395 400 

Val Ala Gin. Asn Ala Ser lie Pro Val Ala Ser His Leu Ser Asp Met 
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405 410 415 

Glu Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gly Leu His Phe Asp Gly 
420 425 430 

Phe Lys Leu Glu Asn Pro Phe Leu Phe Lys Asp Leu Thr Glu Ala Thr 
435 440 445 

Asn Tyr Ser Ala Glu Glu Leu Asp Lys Val Phe Ser Leu Leu Asn lie 
450 455 460 

Asn Asn Ser Leu Leu Glu Asn Lys Gly Ala Thr Phe Lys Glu Ala Glu 
465 470 475 480 

Glu His Tyr His lie Asn Ala Leu Tyr Leu Leu Ala His Ser Ala Leu 
485 490 495 

Glu Ser Asn Trp Gly Arg Ser Lys lie Ala Lys Asp Lys Asn Asn Phe 
500 505 510 

Phe Gly lie Thr Ala Tyr Asp Thr Thr Pro Tyr Leu Ser Ala Lys Thr 
515 520 525 

Phe Asp Asp Val Asp Lys Gly lie Leu Gly Ala Thr Lys Trp lie Lys 
530 535 540 . 

Glu Asn Tyr lie Asp Arg Gly Arg Thr Phe Leu Gly Asn Lys Ala Ser 
545 550 555 , 560 

Gly Met Asn Val Glu Tyr Ala Ser Asp Pro Tyr Trp Gly Glu Lys lie 
565 570 575 

Ala Ser Val Met Met Lys lie Asn Glu Lys Leu Gly Gly Lys Asp 
580 585 590 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

TGGGATTCAA TATGTCAGAG ATGATACTAG AGATAAAGAA GAGGGAATAG AGTATGATGA ^ 60 

CGCTGACAAT GGGGATATTA TTGTAAAAGT AGCGACTAAA CCTAAGGTAG TAACCAAGAA 120 

AATTTCAAGT ACGCGAATTC GTTATGAAAA AGATGAAACA AAAGACCGTA GTGAAAATCC 180 

TGTTACAATT GATGGAGAGG ATGGCTATGT AACTACGACA AGGACCTACG ATGTTAATCC 240 

AGAGACTGGT TATGTTACCG AACAGGTTAC TGTTGATAGA AAAGAAGCCA CGGATACAGT 300 

TATCAAAGTT CCAGCTAAAA GCAAGGTTGA AGAAGTTCTT GTTCCATTTG CTACTAAATA 360 

TGAAGCAGAC AATGACCTTT CTGCAGGACA GGAGCAAGAG ATTACTCTAG GAAAGAATGG 420 

GAAAACAGTT ACAACGATAA CTTATAATGT AGATGGAAAG AGTGGACAAG TAACTGAGAG 480 

TACTTTAAGT CAAAAAAAAG ACTCtCAAAC AAGAGTTGTT AAAAAAAGaA CCArkCCCCA 540 
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AGTTCTTGTC 


CAAGAAATTC 


CAATCGAAAC 


AGAATATCTG 


GATGGCCCaA 


CTCTTGATAA 


600 


AaGTCAAGAA 


GTAGAAGAAG 


TAGGAGAAAT 


TGGTAAATTA 


CTCTTACTAC 


AATCTATACT 


660 


GGTAGATGAA 


CGTGATGGAA 


CAATTGAAGA 


AACTACTTCT 


CGTCAAATTA 


CTAAAGAGAT 


720 


GGTAAAAAGA 


CGTATAAGGA 


GAGGGACGAG 


AGAACCTGAA 


AAAGTTGTTG 


TTCCTGAGCA 


780 


ATCATCTATT 


CCTTCGTATC 


CTGTATCTGT 


TACATCTAAC 


CAAGGAACAG 


ATGTAGCAGT 


840 


AGAACCAGCT 


AAAGCAGTTG 


CTCCAACAAC 


AGACTGGAAA 


CAAGAAAATG 


GTATGTGGTA 


900 


TTTTTATAAT 


ACTGATGGTT 


CCATGGCAAC 


AGGTTGGGTA 


CAAGTTAATA 


GTTCATGGTA 


960 


CTACCTCAAC 


AGCAACGGTT 


CTATGAAAGT 


CAATCAATGG 


TTCCAAGTTG 


GTGGTAAATG 


1020 


GTATTATGTA 


AATACATCGG 


GTGAGTTAGC 


GGTCAATACA 


AGTATAGATG 


GCTATAGAGT 


1080 


CAATGATAAT 


GGTGAATGGG 


TGCGT 








1105 



■4 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 368 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gly lie Gin Tyr Val Arg Asp Asp Thr Arg Asp Lys Glu Glu Gly lie 
15 10 15 

Glu Tyr Asp Asp Ala Asp Asn Gly Asp lie lie Val Lys Val Ala Thr 
20 25 30 

Lys Pro Lys Val Val Thr Lys Lys lie Ser Ser Thr Arg lie Arg Tyr 
35 40 45 

Glu Lys Asp Glu Thr Lys Asp Arg Ser Glu Asn Pro Val Thr lie Asp 
50 55 60 

Gly Glu Asp Gly Tyr Val Thr Thr Thr Arg Thr Tyr Asp Val Asn Pro 
65 70 75 80 

Glu Thr Gly Tyr Val Thr Glu Gin Val Thr Val Asp Arg Lys Glu Ala 
85 90 95 

Thr Asp Thr Val lie Lys Val Pro Ala Lys Ser Lys Val Glu Glu Val 
100 105 110 

Leu Val Pro Phe Ala Thr Lys Tyr Glu Ala Asp Asn Asp Leu Ser Ala 
115 120 125 

Gly Gin Glu Gin Glu lie Thr Leu Gly Lys Asn Gly Lys Thr Val Thr 
130 135 140 

Thr He Thr Tyr Asn Val Asp Gly Lys Ser Gly Gin Val Thr Glu Ser 
145 150 155 160 
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Thr Leu Ser Gin Lys Lys Asp Ser Gin Thr Arg Val Val Lys Lys Arg 
165 170 175 

Thr Xaa Pro Gin Val Leu Val Gin Glu He Pro He Glu Thr Glu Tyr 
180 185 190 

Leu Asp Gly Pro Thr Leu Asp Lys Ser Gin Glu Val Glu Glu Val Gly 
195 200 205 

Glu He Gly Lys Leu Leu Leu Leu Gin Ser He Leu Val Asp Glu Arg 
210 215 220 

Asp Gly Thr He Glu Glu Thr Thr Ser Arg Gin He Thr Lys Glu Met 
225 230 235 240 

Val Lys Arg Arg He Arg Arg Gly Thr Arg Glu Pro Glu Lys Val Val 
245 250 255 

Val Pro Glu Gin Ser Ser He Pro Ser Tyr Pro Val Ser Val Thr Ser 
260 265 270 

Asn Gin Gly Thr Asp Val Ala Val Glu Pro Ala Lys Ala Val Ala Pro 
275 280 285 

Thr Thr Asp Trp Lys Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn Thr 
290 295 300 

Asp Gly Ser Met Ala Thr Gly Trp Val Gin Val Asn Ser Ser Trp Tyr 
305 310 315 320 

Tyr Leu Asn Ser Asn Gly Ser Met Lys Val Asn Gin Trp Phe Gin Val 
325 330 335 

Gly Gly Lys Trp Tyr Tyr Val Asn Thr Ser Gly Glu Leu Ala Val Asn 
340 345 350 

Thr Ser He Asp Gly Tyr Arg Val Asn Asp Asn Gly Glu Trp Val Arg 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

GGATAATAGA GAAGCATTAA AAACCTTTAT GACGGGTGAA AATTTTTATC TCCAACATTA 60 

TCTAGGAGCA CATAGGGAAG AACTAAATGG AGAGCATGGC TATACCTTCC GTGTTTGGGC 120 

ACCTAATGCT CAGGCTGTTC ACTTGGTTGG TGATTTTACC AACTGGATTG AAAATCAGAT 180 

TCCAATGGTA AGAAATGATT TTGGGGTCTG GGAAGTCTTT ACCAATATGG CTCAAGAAGG 240 

GCATATTTAC AAATATCATG TCACACGTCA AAATGGTCAT CAACTGATGA AGATTGACCC 300 

TTTTGCTGTC AGGTATGAGG CTCGTCCAGG AACAGGGGCA ATCGTAACAG AGCTTCCTGA 360 

GAAGAAATGG AAGGATGGAC TTTGGCTGGC ACGAAGAAAA CGTTGGGGCT TTGAAGAGCG 420 
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TCCTGTCAAT ATTTATGAAG TTCACGCTGG ATCATGGAAA AGAAATTCTG ATGGCAGTCC 



480 



TTATAGTTTT GCCCAGCTCA AGGATGAACT CATTCCTTAT CTCGTTGAAA TGAACTATAC 



540 



TCATATTGAG TTTATGCCCT TGATGTCCCA TCCTTTGGGC TTGAGTTGGG GGTATCAGCT 



600 



TATGGGTTAC TTCGCTTTAG AGCATGCTTA TGGCCGACCA GAGGAGTTTC AAGATTTTGT 



660 



C 



661 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Asp Asn Arg Glu Ala Leu Lys Thr Phe Met Thr Gly Glu Asn Phe Tyr 
15 10 15 

Leu Gin His Tyr Leu Gly Ala His Arg Glu Glu Leu Asn Gly Glu His 
20 25 30 

Gly Tyr Thr Phe Arg Val Trp Ala Pro Asn Ala Gin Ala Val His Leu 
35 40 45 

Val Gly Asp Phe Thr Asn Trp lie Glu Asn Gin lie Pro Met Val Arg 
50 55 60 

Asn Asp Phe Gly Val Trp Glu Val Phe Thr Asn Met Ala Gin Glu Gly 
65 70 75 80 

His lie Tyr Lys Tyr His Val Thr Arg Gin Asn Gly His Gin Leu Met 
85 90 95 

Lys lie Asp Pro Phe Ala Val Arg Tyr Glu Ala Arg Pro Gly Thr Gly 
100 105 110 

Ala lie Val Thr Glu Leu Pro Glu Lys Lys Trp Lys Asp Gly Leu Trp 
115 120 125 

Leu Ala Arg Arg Lys Arg Trp Gly Phe Glu Glu Arg Pro Val Asn lie 
130 135 140 

Tyr Glu Val His Ala Gly Ser Trp Lys Arg Asn Ser Asp Gly Ser Pro 
145 150 155 160 

Tyr Ser Phe Ala Gin Leu Lys Asp Glu Leu lie Pro Tyr Leu Val Glu 
165 170 175 

Met Asn Tyr Thr His lie Glu Phe Met Pro Leu Met Ser His Pro Leu 
180 185 190 

Gly Leu Ser Trp Gly Tyr Gin Leu Met Gly Tyr Phe Ala Leu Glu His 
195 200 205 

Ala Tyr Gly Arg Pro Glu Glu Phe Gin Asp Phe Val 
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210 215 220 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 976 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



AGATTTTGTC 


GAGGAGTGTC 


ATACCCATAA 


TATTGGGGTT 


ATTGTGGACT 


GGGTACCAGN 


60 


TCACTTTACC 


ATCAACGATG 


ATGCCTTAGC 


CTATTATGAT 


GGGACACCGA 


CTTTTGAATA 


120 


CCAAGACCAT 


AATAAGGCTC 


ATAACCATGG 


TTGGGGTGCC 


CTTAATTTTG 


ACCTTGGAAA 


180 


AAATGAAGTC 


CAGTCCTTCT 


TAATTTCTTG 


CATTAAGCAT 


TGGATTGATG 


TCTATCATTT 


240 


GGATGGTATT 


CGTGTGGATG 


CTGTTAGCAA 


CATGCTCTAT 


TTGGACTATG 


ATGATGCTCC 


300 


ATGGACACCT 


AATAAAGATG 


GCGGAAATCT 


CAACTATGAA 


GGTTATTATT 


TCCTTCAGCG 


360 


CTTGAATGAG 


GTTATTAAGT 


TAGAATATCC 


AGATGTGATG 


ATGATTGCAG 


AAGAAAGTTC 


420 


GTCTGCGATC 


AAGATTACGG 


GAATGAAAGA 


GATTGGTGGT 


CTAGGATTTG 


ACTACAAATG 


480 


GAACATGGGC 


TGGATGAATG 


ATATCCTCCG 


TTTCTACGAA 


GAAGATCCGA 


TCTATCGTAA 


540 


ATATGACTTT 


AACCTGGTGA 


CTTTCAGCTT 


TATGTATGTT 


TNCAAGGAGA 


ATTATCTCTT 


600 


GCCATTCTCG 


CACGATGAAG 


TGGTTCATGG 


CAAGAAGAGT 


ATGATGCATA 


AGATGTGGGG 


660 


AGATCGTTAC 


AATCAATTCG 


CAGGCTTGCG 


CAATCTCTAT 


ACGTACCAAA 


TTTGTCACCC 


720 


TGGTAAGAAA 


TTGCTCTTCA 


TGGGTAGCGA 


ATACGGTCAA 


TTCCTAGAAT 


GGAAATCTGA 


780 


AGAACAGTTG 


GAATGGTCTA 


ACCTAGAAGA 


CCCAATGAAT 


GCTAAGATGA 


AGTATTTCGC 


840 


TTCTCAGCTA 


AACCAGTTTT 


ACAAAGATCA 


TCGCTGTCTG 


TGGGAAATTG 


ATACCAGCTA 


900 


TGATGGTATT 


GAAATCATTG 


ATGCGGATAA 


TCGAGACCAG 


AGTGTTCTTT 


CCTTTATTCG 


960 


TAAGGGTAAA 


AAGGGA 










976 


(2) INFORMATION FOR SEQ ID NO: 80; 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Asp Phe Val Glu Glu Cys His Thr His Asn lie Gly Val lie Val Asp 
1 5 10 15 
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Trp Val Pro Xaa His Phe Thr lie Asn Asp Asp Ala Leu Ala Tyr Tyr 
20 25 30 

Asp Gly Thr Pro Thr Phe Glu Tyr Gin Asp His Asn Lys Ala His Asn 
35 40 45 

His Gly Trp Gly Ala Leu Asn Phe Asp Leu Gly Lys Asn Glu Val Gin 
50 55 60 

Ser Phe Leu lie Ser Cys He Lys His Trp He Asp Val Tyr His Leu 
65 70 75 80 

Asp Gly He Arg Val Asp Ala Val Ser Asn Met Leu Tyr Leu Asp Tyr 
85 90 95 

Asp Asp Ala Pro Trp Thr Pro Asn Lys Asp Gly Gly Asn Leu Asn Tyr 
100 105 110 

Glu Gly Tyr Tyr Phe Leu Gin Arg Leu Asn Glu Val He Lys Leu Glu 
115 120 125 

Tyr Pro Asp Val Met Met He Ala Glu Glu Ser Ser Ser Ala He Lys 
130 135 140 

He Thr Gly Met Lys Glu He Gly Gly Leu Gly Phe Asp Tyr Lys Trp 
145 150 155 160 

Asn Met Gly Trp Met Asn Asp He Leu Arg Phe Tyr Glu Glu Asp Pro 
165 170 175 

He Tyr Arg Lys Tyr Asp Phe Asn Leu Val Thr Phe Ser Phe Met Tyr 
180 185 190 

Val Xaa Lys Glu Asn Tyr Leu Leu Pro Phe Ser His Asp Glu Val Val 
195 200 205 

His Gly Lys Lys Ser Met Met His Lys Met Trp Gly Asp Arg Tyr Asn 
210 215 220 

Gin Phe Ala Gly Leu Arg Asn Leu Tyr Thr Tyr Gin He Cys His Pro 
225 230 235 _ 240 

Gly Lys Lys Leu Leu Phe Met Gly Ser Glu Tyr Gly Gin Phe Leu Glu 
245 250 255 

Trp Lys Ser Glu Glu Gin Leu Glu Trp Ser Asn Leu Glu Asp Pro Met 
260 265 270 

Asn Ala Lys Met Lys Tyr Phe Ala Ser Gin Leu Asn Gin Phe Tyr Lys 
275 280 285 

Asp His Arg Cys Leu Trp Glu He Asp Thr Ser Tyr Asp Gly He Glu 
290 295 300 

He He Asp Ala Asp Asn Arg Asp Gin Ser Val Leu Ser Phe He Arg 
305 310 315 320 

Lys Gly Lys Lys Gly 
325 

INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2134 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



ATCTGTAGTT 


TATGCGGATG 


TV TV T\ Ti mm TV m 

AAACACTTAT 


m TV /"I m/^ TV m tv m 

TACTCATACT 


/^/^m/^ TV /"< TV TV TV 

GC TGAGAAAC 


/*^m AAA /~1 A t^f^ A 

C T AAAG AGGA 


60 


AAAAATGATA 


GTAGAAGAAA 


AGGCTGATAA 


AGC TTTGG AA 


TV m TV TV TV TV TV m TV 

AC T AAAAAT A 


m TV ^^mm^^ tv tv tv 

TAGTTGAAAG 


120 


GACAGAACAA 


AGTGAACCTA 


GTTC AAC TGA 


GGCTATTGCA 


m/^m/^ TV /^ivTTv /~i TV 

TC TG AGNAG A 


TV TV TV TV TV m/^ TV 

AAGAAGATGA 


180 


AGCCGTAACT 


CCAAAAGAGG 


TV TV TV TV T^ ✓^fTl^^m/^ 

AAAAAGTGTC 


m/^/^mTV TV TV 

TGCTAAACCG 


/"< TV TV "A TV TV TV TV 

GAAGAAAAAG 


C TC C AAGGAT 


z40 


AGAATCACAA 


GCTTCAAATC 


AAGAAAAACC 


/~»/^m/~' TV TV /~^f~y TV TV 

GCTCAAGGAA 


GATGCTAAAG 


mo m A A A A A 

C TGT AAC AAA 


"3 n A 


TGAAGAAGTG 


AATCAAATGA 


TTGAAGACAG 


TV TV TV /^m/^/^ TV m 

GAAAGTGGAT 


mmmTv Am/^A a a 

TTTAATCAAA 


ATTGGT AC TT 


o f n 


TAAACTCAAT 


GCAAATTCTA 


TV f^r^ TV TV f^r^r^ tv m 

AGG AAGC CAT 


m TV TV TV /^/^ A m 

TAAACC TGAT 


glacjAlcj ta r 


L T AC (jTGCj AA 


j1 O A 


AAAATTAGAT 


TTACCGTATG 


TV /^m/~»/^ TV y^m 7v m 

AC TGGAGT AT 


/^mmrmv tv /^/^tv m 

C TTTAAC GAT 


ttcgatcatg 


AA TL TC C TCjC 


^ O A 


ACAAAATGAA 


GGTGGACAGC 


m/^ TV TV m /~i 

TCAACGGTGG 


GGAAGCTTGG 


m A m/~<*^/^ A A A 

TATC GC AAGA 


C TTTC AAAC T 


C /I A 

d40 


AGATGAAAAA 


GACCTCAAGA 


Ti TV TV TV frt^*irnm/^/^ 

AAAATGTTCG 


CCTTACTTTT 


GATGGC GTC T 


A A m/^/^ A 1 111 1 yf* 

ACATGGATTC 


c f\r\ 


TCAAGTTTAT 


GTCAATGGTC 


AGTTAGTGGG 


TV mm TV m/^ tv 

GCATTATCCA 


TV TV m/^/^mrmv m tv 

AATGGTTATA 


A /^/~iA /^rnrny~trn/^ 

ACCAGTTCTC 


c C f\ 


ATATGATATC 


ACCAAATACC 


TTCAAAAAGA 


TGGTCGTGAG 


TV m/^m/^ TV fT^m^^ 

AATGTGATTG 


/^m/^m/^/^Tv m/^/^ 

CTGTCCATGC 


720 


AGTCAACAAA 


CAGCCAAGTA 


GCCGTTGGTA 


mrT>/^ TV ^^^^ TV TV /*im 

TTCAGGAAGT 


/^/^mTV m/^mTV m/^ 

GGTATCTATC 


/^m^^ TV m/^m/^TV 

GTGATGTGAC 


T O A 

780 


TTTACAAGTG 


ACAGATAAGG 


TGCATGTTGA 


TV TV TV TV TV m/^^*^/^ 

GAAAAATGGG 


TV TV TV /^m TV mmm 

ACAACTATTT 


m TV TV TV TV TV TV 

TAACACCAAA 


O /I A 

840 


ACTTGAAGAA 


CAACAACATG 


GCAAGGTTGA 


AACTCATGTG 


ACCAGCAAAA 


TCGTCAATAC 


900 


GGACGACAAA 


GACCATGAAC 


TTGTAGCCGA 


ATATCAAATC 


GTTGAACGAG 


GTGGTCATGC 


960 


TGTAACAGGC 


TTAGTTCGTA 


CAGCGAGTCG 


TACCTTAAAA 


GCACATGAAT 


CAACAAGCCT 


1020 


AGATGCGATT 


TTAGAAGTTG 


TV TV TV TV TV TV TV 

AAAG AC C AAA 


TV m m TV 

ACTCTGGACT 


GTTTTAAATG 


TV TV TV TV 

AC AAAC C TGC 


1 A O A 

lOoO 


CTTGTACGAA 


TTGATTACGC 


GTGTTTACCG 


TGACGGTCAA 


TTGGTTGATG 


m A A A A A 

C T AAG AAGG A 


T T /I A 
1140 


TTTGTTTGGT 


TACCGTTACT 


TV m/^ TV /^m/~i/^ TV 

ATCACTGGAC 


m/^/^TV TV TV m/^ TV TV 

TCCAAATGAA 


GGTTTCTCTT 


TGAATGGTGA 


1 O A A 

IzUU 


ACGTATTAAA 


TTCCATGGAG 


TATCCTTGCA 


CCACGACCAT 


GGGGCGCTTG 


GAGCAGAAGA 


1260 


AAACTATAAA 


GCAGAATATC 


GCCGTCTCAA 


ACAAATGAAG 


GAGATGGGAG 


TTAACTCCAT 


1320 


CCGTACAACC 


CACAACCCTG 


CTAGTGAGCA 


AACCTTGCAA 


ATCGCAGCAG 


AACTAGGTTT 


1380 


ACTCGTTCAG 


GAAGAGGCCT 


TTGATACGTG 


GTATGGTGGC 


AAGAAACCTT 


ATGACTATGG 


1440 


ACGTTTCTTT 


GAAAAAGATG 


CCACTCACCC 


AGAAGCTCGA 


AAAGGTGAAA 


AATGGTCTGA 


1500 


TTTTGACCTA 


CGTACCATGG 


TCGAAAGAGG 


CAAAAACAAC 


CCTGCTATCT 


TCATGTGGTC 


1560 


AATTGGTAAT 


GAAATAGGTG 


AAGCTAATGG 


TGATGCCCAC 


TCTTTAGCAA 


CTGTTAAACG 


1620 


TTTGGTTAAG 


GTTATCAAGG 


ATGTTGATAA 


GACTCGCTAT 


GTTACCATGG 


GAGCAGATAA 


1680 
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ATTCCGTTTC GGTAATGGTA GCGGAGGGCA TGAGAAAATT GCTGATGAAC TCGATGGTGT 1740 

TGGATTTAAC TATTCTGAAG ATAATTACAA AGCCCTTAGA GCTAAGCATC CAAAATGGTT 1800 

GATTTATGGA TCAGAAACAT CTTCAGCTAC CCGTACACGT GGAAGTTACT ATCGCCCTGA 1860 

ACGTGAATTG AAACATAGCA ATGGACCTGA GCGTAATTAT GAACAGTCAG ATTATGGAAA 1920 

TGATCGTGTG GGTTGGGGGA AAACAGCAAC CGCTTCATGG ACTTTTGACC GTGACAACGC 1980 

TGGCTATGCT GGACAGTTTA TCTGGACAGG TACGGAGTAT ATTGGTGAAC CTACACCATG 2040 

GCACAACCAA AATCAAACTC CTGTTAAGAG CTCTTACTTT GGTATCGTAG ATACAGCCGG 2100 

CATTCCAAAA CATGACTTCT ATCTCTACCA AAGC 2134 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Ser Val Val Tyr Ala Asp Glu Thr Leu He Thr His Thr Ala Glu Lys 
15 10 15 

Pro Lys Glu Glu Lys Met He Val Glu Glu Lys Ala Asp Lys Ala Leu 
20 25 30 

Glu Thr Lys Asn He Val Glu Arg Thr Glu Gin Ser Glu Pro Ser Ser 
35 40 45 

Thr Glu Ala He Ala Ser Glu Xaa Lys Glu Asp Glu Ala Val Thr Pro 
50 55 60 

Lys Glu Glu Lys Val Ser Ala Lys Pro Glu Glu Lys Ala Pro Arg He 
65 70 75 80 

Glu Ser Gin Ala Ser Asn Gin Glu Lys Pro Leu Lys Glu Asp Ala Lys 
85 90 95 

Ala Val Thr Asn Glu Glu Val Asn Gin Met He Glu Asp Arg Lys Val 
100 105 110 

Asp Phe Asn Gin Asn Trp Tyr Phe Lys Leu Asn Ala Asn Ser Lys Glu 
115 120 125 

Ala He Lys Pro Asp Ala Asp Val Ser Thr Trp Lys Lys Leu Asp Leu 
130 135 140 

Pro Tyr Asp Trp Ser He Phe Asn Asp Phe Asp His Glu Ser Pro Ala 
145 150 155 160 

Gin Asn Glu Gly Gly Gin Leu Asn Gly Gly Glu Ala Trp Tyr Arg Lys 
165 170 175 

Thr Phe Lys Leu Asp Glu Lys Asp Leu Lys Lys Asn Val Arg Leu Thr 
180 185 190 



86 



Phe Asp Gly Val Tyr Met Asp Ser Gin Val Tyr Val Asn Gly Gin Leu 
195 200 205 

Val Gly His Tyr Pro Asn Gly Tyr Asn Glri Phe Ser- Tyr Asp lie Thr 
210 215 220 

Lys Tyr Leu Gin Lys Asp Gly Arg Glu Asn Val lie Ala Val His Ala 
225 230 235 240 

Val Asn Lys Gin Pro Ser Ser Arg Trp Tyr Ser Gly Ser Gly lie Tyr 
245 250 255 

Arg Asp Val Thr Leu Gin Val Thr Asp Lys Val His Val Glu Lys Asn 
260 265 270 

Gly Thr Thr lie Leu Thr Pro Lys Leu Glu Glu Gin Gin His Gly Lys 
275 280 285 

Val Glu Thr His Val Thr Ser Lys lie Val Asn Thr Asp Asp Lys Asp 
290 295 300 

His Glu Leu Val Ala Glu Tyr Gin lie Val Glu Arg Gly Gly His Ala 
305 310 315 320 

Val Thr Gly Leu Val Arg Thr Ala Ser Arg Thr Leu Lys Ala His Glu 
325 330 335 

Ser Thr Ser Leu Asp Ala lie Leu Glu Val Glu Arg Pro Lys Leu Trp 
340 345 350 

Thr Val Leu Asn Asp Lys Pro Ala Leu Tyr Glu Leu lie Thr Arg Val 
355 360 365 

Tyr Arg Asp Gly Gin Leu Val Asp Ala Lys Lys Asp Leu Phe Gly Tyr 
370 375 380 

Arg Tyr Tyr His Trp Thr Pro Asn Glu Gly Phe Ser Leu Asn Gly Glu 
385 390 395 400 

Arg lie Lys Phe His Gly Val Ser Leu His His Asp His Gly Ala Leu 
405 410 415 

Gly Ala Glu Glu Asn Tyr Lys Ala Glu Tyr Arg Arg Leu Lys Gin Met 
420 425 430 

Lys Glu Met Gly Val Asn Ser lie Arg Thr Thr His Asn Pro Ala Ser 
435 440 445 

Glu Gin Thr Leu Gin lie Ala Ala Glu Leu Gly Leu Leu Val Gin Glu 
450 455 460 

Glu Ala Phe Asp Thr Trp Tyr Gly Gly Lys Lys Pro Tyr Asp Tyr Gly 
465 470 475 480 

Arg Phe Phe Glu Lys Asp Ala Thr His Pro Glu Ala Arg Lys Gly Glu 
485 490 495 

Lys Trp Ser Asp Phe Asp Leu Arg Thr Met Val Glu Arg Gly Lys Asn 
500 505 510 

Asn Pro Ala lie Phe Met Trp Ser lie Gly Asn Glu lie Gly Glu Ala 
515 520 525 
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Asn Gly Asp Ala His Ser Leu Ala Thr Val Lys Arg Leu Val Lys Val 
530 535 540 

lie Lys Asp Val Asp Lys Thr Arg Tyr Val Thr Met Gly Ala Asp Lys 
545 550 555 560 

Phe Arg Phe Gly Asn Gly Ser Gly Gly His Glu Lys lie Ala Asp Glu 
565 570 575 

Leu Asp Ala Val Gly Phe Asn Tyr Ser Glu Asp Asn Tyr Lys Ala Leu 
580 585 590 

Arg Ala Lys His Pro Lys Trp Leu lie Tyr Gly Ser Glu Thr Ser Ser 
595 600 605 

Ala Thr Arg Thr Arg Gly Ser Tyr Tyr Arg Pro Glu Arg Glu Leu Lys 
610 615 620 

His Ser Asn Gly Pro Glu Arg Asn Tyr Glu Gin Ser Asp Tyr Gly Asn 
625 630 635 640 

Asp Arg Val Gly Trp Gly Lys Thr Ala Thr Ala Ser Trp Thr Phe Asp 
645 650 655 

Arg Asp Asn Ala Gly Tyr Ala Gly Gin Phe lie Trp Thr Gly Thr Asp 
660 665 670 

Tyr lie Gly Glu Pro Thr Pro Trp His Asn Gin Asn Gin Thr Pro Val 
675 680 685 

Lys Ser Ser Tyr Phe Gly lie Val Asp Thr Ala Gly lie Pro Lys His 
690 695 700 

Asp Phe Tyr Leu Tyr Gin Ser 
705 710 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2167 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

TTACTTTGGT ATCGTAGATA CAGCCGGCAT TCCAAAACAT GACTTCTATC TCTACCAAAG 60 

CCAATGGGTT TCTGTTAAGA AGAAACCGAT GGTACACCTT CTTCCTCACT GGAACTGGGA 120 

AAACAAAGAA TTAGCATCCA AAGTAGCTGA CTCAGAAGGT AAGATTCCAG TTCGTGCTTA 180 

TTCGAATGCT TCTAGTGTAG AATTGTTCTT GAATGGAAAA TCTCTTGGTC TTAAGACTTT 240 

CAATAAAAAA CAAACCAGCG ATGGGCGGAC TTACCAAGAA GGTGCAAATG CTAATGAACT 300 

TTATCTTGAA TGGAAAGTTG CCTATCAACC AGGTACCTTG GAAGCAATTG CTCGTGATGA 360 

ATCTGGCAAG GAAATTGCTC GAGATAAGAT TACGACTGCT GGTAAGCCAG CGGCAGTTCG 420 

TCTTATTAAG GAAGACCATG CGATTGCAGC AGATGGAAAA GACTTGACTT ACATCTACTA 480 
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TGAAATTGTT 


GAL AG L L AGG 


GGAA 1 G 1 GG i 


TT'^ A A r*fT^r*r*T^ 
i UGAAL. 1 (jL. i 


AA i AA i L. 1 vjtj 






ATTGCATGGC 


^ TV TV O/^rn/^ TV TV O 

L AAGG i L AAL 


rp 0 m 0 0 rp rp 




/^A A/^AA/^0(^A 

AAU AAtjU U A 


VjL- i AAL. 


D u u 


CTATAAGGCG 


CAAGCAGATG 


y^ mm y^ mrT^ tv rri 

GTTL i IGGA i 


Tl/^/^TlA A A/^/^A 


rprnrpA A rp/*^/*^rpA 

i i i AAi Cjttjji A 


AALtLi i Li i i LrL 


b D U 


CATTGTCAAA 


TCAACTGAAC 


TV TV/^/^TV/^OOTV A 

AAGG AGGCjAA 


AX ILAULC ILf 


AL 1 (jtL-ULAL. i 


L i Lr A i L i L 1 i 


•ion 


TV TV TV TV TV 

GAAATCGAAC 


CAAGTCACTG 


TLTri AG iGG 


rpA A/^AA A/^A A 

i AAU AAAVjj AA 


GkjAL AAO ALtA 


A A /^rp/^rnrprprp 

ALr AL i Lf i i 1 i 


/ 0 U 


tv TV TV TV 

GGGGACAGAA 


/*im/^/~1/^ TV TV TV TV /~* 

GTGCCAAAAG 


TACAGACCAT 


1 A i i GOAGAG 


GGAGG i (jAAA 


ILLL iALLAL 


Q /t n 


TGTTCCGTTT 


m TV m TV TV /^m/*t 

GTATACAGTG 


TV m^^m TV /"'/^OO 

ATGGT AGC C G 


TGCAGAACGT 


^^rporpA A/~'Om 

G G TGT AAG G l 


GGlLTiLAGl 


Q r\ r* 


AGATGTGAGC 


TV TV yi m/^ y^ m tv 

AAGCCTGGTA 


TTGTAACGGT 


^ TV A AO^rPAm/^ 

GAAAGGTATG 


GCTGACGGAC 


0 A A A /"TP A 0 A 

GAGAAGTAGA 


0 c n 


AGCTCGTGTA 


GAAGTGATTG 


/**im/*^mmTV tv tv 

CTCTTAAATC 


TV TV /^/^m TV /^/^ TV 

AGAGCTACCA 


/^rpm/^rp/^ tv tv tv 

GTTGTGAAAC 


GTATTGCTCC 


1 r\ 0 n 
1U2 U 


AAATACTGAC 


TTGAATTCTG 


m TV TV TV TV TV m/** 

TAGACAAATC 


TGTTTCCTAT 


GTTTTGATTG 


A TGGAAG 1 G 1 


1 n 0 
lUo U 


TGAAGAGTAT 


GAAGTGGACA 


TV /^m^^ TV TV m 

AGTGGGAGAT 


IT1/^/^/^/~< TV TV TV TV 

TGCCGAAGAA 


TV rPTV TV TV /^/^m TV 

G ATAAAGG T A 


■R /^mmTv z^/"* TV A m 

AGTTAGLAAl 


i 1 ^ n 
114U 


TCCAGGTTCT 


CGTATTCAAG 


GGAGLGGi 1 A 


rprprn A A A/^Orp 

i 1 lAGAAGGi 


OA A A A rprn/^ 

GAAGGAAi iG 


A i GG AALLL i 


1 0 n r\ 
Iz U U 


TGTGGTAGAA 


TV TV TV TV 

GAAGGCAATC 


CTGCGGCACC 


A ^rp A A 

TGGAGl AGGA 


AG IGiAAGGG 


rprpo^mo/^rpO A 

i iGGiGGiGA 


1 0 c r\ 


TV m TV TV TV 

GGCAGTAACA 


GGTCTTACTA 


/^m/^ TV TV TV TV TV /^/^ 

GTC AAAAAG G 


A A rp/^ ^ A A rp A 

AATGGAATAG 


GGGAGrC 1 IG 


G i iAi GGAGG 


1 "3 0 A 


m TV TV ^^mm/^/^/^TV 

TAAGTTGCCA 


/"» TV TV i^m/*t TV TV /~t 

GAAGTCACAG 


/-» TV 7\ rn/~i/~^ rn 7\ tv 

CAAGTGCTAA 


AAATGCAGCT 


/~i mm 7\ /~i 7\ omm/^ 
GTTAG AG TIG 


rnmo A A f^r* A A 

T. 1 G AAGG AAG 


1 "3 Q n 


m /-^ TV TV TV 

CGCAGCAAAC 


GGCATGCGTG 


CGAGCATCTT 


TATTGAGGG 1 


A A A A TlOOrpO 

AAAGAlGGiG 


GLGG iG 1 iLA 


1 /I /I n 


AACCTATGCA 


TV mm/^ TV TV rr^fTn/^/^ 

ATTCAATTCC 


mm/^ TV 7v /~» 7\ TV 

TTGAAGAAGC 


^/^OA AA A A rprp 

GCCAAAAATT 


GCTCACTTGA 


mm/~^ A 7\ 

GLTTGLAAG 1 


1 R n n 


y***^^TV TV TV TV TV y^ >^ m 

GGAAAAAGCT 


GACAGTCTCA 


TV TV TV TV TV TV 

AAG AAG AC C A 


TV TV /*im/^rpo AAA 

AACTGTCAAA 


TTGTCGGTTC 


/~i 7S f^f^mr^ 7\ /~irn A 

GAGCTCACTA 


1 c c n 


TCAAGATGGA 


TV ^^y^^**<TV TV /^^^m/^ 

ACGCAAGCTG 


m TV mm tv 7v ^1 

T ATT AC C AGC 


mf^ TV m TV TV TV r^m tv 

TGATAAAGTA 


TV /~»/^rnrn/^rn/^rn TV 

ACCTTCTCTA 


0 A A om/^orpo A 

CAAGTGGTGA 


1 cr 0 A 


AGGGGAAGTC 


GCAATTCGTA 


AAGGAATGCT 


m/**i TV /*irt*im/^ /*i tv m 

TGAGTTGCAT 


TV TV y^ y^ TV TV 

AAGCCAGGAG 


TV m^^ TV m 

C AGTC AC TC T 


1 c 0 r\ 


GAACGCTGAA 


TATGAGGGAG 


C TAAAGAC C A 


TV ^^mm/*i TV TV /^fn/^ 

AGTTGAACTC 


TV y^ m TV m/^ tv tv /~i 

ACTATCCAAG 


f^f* TV TV rPTv /^m/~i TV 

CC AATAC TGA 


1 /4U 


GAAGAAGATT 


GCGCAATCCA 


m y^ y-^ m y~t my*l m 

TCCGTCCTGT 


TV TV TV /Tl/^mTV /^fTl^^ 

AAATGTAGTG 


TV TV TV mmm^^/^ 

ACAGATTTGC 


TV m/~^ TV /"I /"I TV TV /"^ 

ATC AGG AAC C 


1 0 n A 

1800 


AAGTCTTCCA 


y^ y^ TV TV TV ✓'trriTv tv 

GCAACAGTAA 


TV mm/^ TV m tv 

CAGTTGAGTA 


nr>/*^ TV TV TV TV /^/^m 

TGACAAAGGT 


mm/^/^/^rp TV tv tv tv 

TTCCCTAAAA 


/"rn/^Tv m TV tv a /^m 

G TL AT AAAGT 


lobO 


CACTTGGCAA 


GCTATTCCGA 


TV TV TV TV TV TV TV /^m 

AAGAAAAACT 


TV TV /~trp/~i/^m TV m 

AGACTCCTAT 


TV TV TV TV mmm/^ 

GAAAGATTTG 


TV TV /*irn A /^rn A f^f^ 

AAG TAG 1 AGG 


1 cn A 

iyz 0 


m TV TV TV /*tmf^/^ TV TV 

TAAAGTTGAA 


GGAATTG AC C 


TTGAAGGGGG 


rp^O A A A A /^rpO 

I GG AAAAG i G 


rporp/^rpA O A A O 

1 G 1 G i AGAAG 


Gi AiGGi i IG 


T Q Q A 


AGTTGAAGAA 


GTCAGTGTGA 


CAACTCCAAT 


CGCAGAAGCA 


CCACAATTAC 


CAGAAAGTGT 


2040 


TCGGACATAT 


GATTCAAATG 


GTCACGTTTC 


ATCAGCTAAG 


GTTGCATGGG 


ATGCGATTCG 


2100 


TCCAGAGCAA 


TACGCTAAGG 


AAGGTGTCTT 


TACAGTTAAT 


GGTCGCTTAG 


AAGGTACGCA 


2160 


ATTAACA 
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(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



89 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Tyr Phe Gly lie Val Asp Thr Ala Gly lie Pro Lys His Asp Phe Tyr 
15 10 15 

Leu Tyr Gin Ser Gin Trp Val Ser Val Lys Lys Lys Pro Met Val His 
20 25 30 

Leu Leu Pro His Trp Asn Trp Glu Asn Lys Glu Leu Ala Ser Lys Val 
35 40 45 

Ala Asp Ser Glu Gly Lys lie Pro Val Arg Ala Tyr Ser Asn Ala Ser 
50 55 60 

Ser Val Glu Leu Phe Leu Asn Gly Lys Ser Leu Gly Leu Lys Thr Phe 
65 70 75 80 

Asn Lys Lys Gin Thr Ser Asp Gly Arg Thr Tyr Gin Glu Gly Ala Asn 
85 90 95 

Ala Asn Glu Leu Tyr Leu Glu Trp Lys Val Ala Tyr Gin Pro Gly Thr 
100 105 110 

Leu Glu Ala lie Ala Arg Asp Glu Ser Gly Lys Glu lie Ala Arg Asp 
115 120 125 

Lys lie Thr Thr Ala Gly Lys Pro Ala Ala Val Arg Leu lie Lys Glu 
130 135 140 

Asp His Ala lie Ala Ala Asp Gly Lys Asp Leu Thr Tyr lie Tyr Tyr 
145 150 155 160 

Glu lie Val Asp Ser Gin Gly Asn Val Val Pro Thr Ala Asn Asn Leu 
165 170 175 

Val Arg Phe Gin Leu His Gly Gin Gly Gin Leu Val Gly Val Asp Asn 
180 185 190 

Gly Glu Gin Ala Ser Arg Glu Arg Tyr Lys Ala Gin Ala Asp Gly Ser 
195 200 205 

Trp lie Arg Lys Ala Phe Asn Gly Lys Gly Val Ala lie Val Lys Ser 
210 215 220 

Thr Glu Gin Ala Gly Lys Phe Thr Leu Thr Ala His Ser Asp Leu Leu 
225 230 235 240 

Lys Ser Asn Gin Val Thr Val Phe Thr Gly Lys Lys Glu Gly Gin Glu 
245 250 255 

Lys Thr Val Leu Gly Thr Glu Val Pro Lys Val Gin Thr lie lie Gly 
260 265 270 

Glu Ala Pro Glu Met Pro Thr Thr Val Pro Phe Val Tyr Ser Asp Gly 
275 280 285 

Ser Arg Ala Glu Arg Pro Val Thr Trp Ser Ser Val Asp Val Ser Lys 
290 295 300 
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Pro Gly lie Val Thr Val Lys Gly Met Ala Asp Gly Arg Glu Val Glu 
305 310 315 320 

Ala Arg Val Glu Val lie Ala Leu Lys Ser Glu Leu Pro Val Val Lys 
325 330 335 

Arg lie Ala Pro Asn Thr Asp Leu Asn Ser Val Asp Lys Ser Val Ser 
340 345 350 

Tyr Val Leu lie Asp Gly Ser Val Glu Glu Tyr Glu Val Asp Lys Trp 
355 360 365 

Glu lie Ala Glu Glu Asp Lys Ala Lys Leu Ala He Pro Gly Ser Arg 
370 375 380 

He Gin Ala Thr Gly Tyr Leu Glu Gly Gin Pro He His Ala Thr Leu 
385 390 395 400 

Val Val Glu Glu Gly Asn Pro Ala Ala Pro Ala Val Pro Thr Val Thr 
405 410 415 

Val Gly Gly Glu Ala Val Thr Gly Leu Thr Ser Gin Lys Pro Met Gin 
420 425 430 

Tyr Arg Thr Leu Ala Tyr Gly Ala Lys Leu Pro Glu Val Thr Ala Ser 
435 440 445 

Ala Lys Asn Ala Ala Val Thr Val Leu Gin Ala Ser Ala Ala Asn Gly 
450' 455 460 

Met Arg Ala Ser He Phe He Gin Pro Lys Asp Gly Gly Pro Leu Gin 
465 470 475 480 

Thr Tyr Ala He Gin Phe Leu Glu Glu Ala Pro Lys He Ala His Leu 
485 490 495 

Ser Leu Gin Val Glu Lys Ala Asp Ser Leu Lys Glu Asp Gin Thr Val 
500 505 510 

Lys Leu Ser Val Arg Ala His Tyr Gin Asp Gly Thr Gin Ala Val Leu 
515 520 525 

Pro Ala Asp Lys Val Thr Phe Ser Thr Ser Gly Glu Gly Glu Val Ala 
530 535 540 

He Arg Lys Gly Met Leu Glu Leu His Lys Pro Gly Ala Val Thr Leu 
545 550 555 560 

Asn Ala Glu Tyr Glu Gly Ala Lys Asp Gin Val Glu Leu Thr He Gin 
565 570 575 

Ala Asn Thr Glu Lys Lys He Ala Gin Ser He Arg Pro Val Asn Val 
580 585 590 

Val Thr Asp Leu His Gin Glu Pro Ser Leu Pro Ala Thr Val Thr Val 
595 600 605 

Glu Tyr Asp Lys Gly Phe Pro Lys Thr His Lys Val Thr Trp Gin Ala 
610 615 620 

He Pro Lys Glu Lys Leu Asp Ser Tyr Gin Thr Phe Glu Val Leu Gly 
625 630 635 640 

Lys Val Glu Gly He Asp Leu Glu Ala Arg Ala Lys Val Ser Val Glu 
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645 650 655 

Gly He Val Ser Val Glu Glu Val Ser Val Thr Thr Pro He Ala Glu 
660 665 670 

Ala Pro Gin Leu Pro Glu Ser Val Arg Thr Tyr Asp Ser Asn Gly His 
675 680 685 

Val Ser Ser Ala Lys Val Ala Trp Asp Ala He Arg Pro Glu Gin Tyr 
690 695 700 

Ala Lys Glu Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gly Thr Gin 
705 710 715 720 

Leu Thr 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



AGCTAAGGTT 


GCATGGGATG 


CGATTCGTCC 


AGAGCAATAC 


GCTAAGGAAG 


GTGTCTTTAC 


60 


AGTTAATGGT 


CGCTTAGAAG 


GTACGCAATT 


AACAACTAAA 


CTTCATGTTC 


GCGTATCTGC 


120 


TCAAACTGAG 


CAAGGTGCAA 


ACATTTCTGA 


CCAATGGACC 


GGTTCAGAAT 


TGCCACTTGC 


180 


CTTTGCTTCA 


GACTCAAATC 


CAAGCGACCC 


AGTTTCAAAT 


GTTAATGACA 


AGCTCATTTC 


240 


CTACAATAAC 


CAACCAGCCA 


ATCGTTGGAC 


AAACTGGAAT 


CGTACTAATC 


CAGAAGCTTC 


300 


AGTCGGTGTT 


CTGTTTGGAG 


ATTCAGGTAT 


CTTGAGCAAA 


CGCTCCGTTG 


ATAATCTAAG 


360 


TGTCGGATTC 


CATGAAGACC 


ATGGAGTTGG 


TGTACCGAAG 


TCTTATGTGA 


TTGAGTATTA 


420 


TGTTGGTAAG 


ACTGTCCCAA 


CAGCTCCTAA 


AAACCCTAGT 


TTTGTTGGTA 


ATGAGGACCA 


480 


TGTCTTTAAT 


GATTCTGCCA 


ACTGGAAACC 


AGTTACTAAT 


CTAAAAGCCC 


CTGCTCAACT 


540 


CAAGGCTGGA 


GAAATGAACC 


ACTTTAGCTT 


TGATAAAGTT 


GAAACCTATG 


CTGTTCGTAT 


600 


TCGCATGGTT 


AAAGCAGATA 


ACAAGCGTGG 


AACGTCTATC 


ACAGAGGTAC 


AAATCTTTGC 


660 


GAAACAAGTT 


GCGGCAGCCA 


AGCAAGGACA 


AACAAGAATC 


CAAGTTGACG 


GCAAAGACTT 


720 


AGCAAACTTC 


AACCCTGATT 


TGACAGACTA 


CTACCTTGAG 


TCTGTAGATG 


GAAAAGTTCC 


780 


GGCAGTCACA 


GCAAGTGTTA 


GCAACAATGG 


TCTCGCTACC 


GTCGTTCCAA 


GCGTTCGTGA 


840 


AGGTGAGCCA 


GTTCGTGTCA 


TCGCGAAAGC 


TGAAAATGGC 


GACATCTTAG 


GAGAATACCG 


900 


TCTGCACTTC 


ACTAAGGATA 


AGAGCTTACT 


TTCTCATAAA 


CCAGTTGCTG 


CGGTTAAACA 


960 


AGCTCGCTTG 


CTACAAGTAG 


GTCAAGCACT 


TGAATTGCCG 


ACTAAGGTTC 


CAGTTTACTT 


1020 


CACAGGTAAA 


GACGGCTACG 


AAACAAAAGA 


CCTGACAGTT 


GAATGGGAAG 


AAGTTCCAGC 


1080 
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GGAAAATCTG 


ACAAAAGCAG 


GTCAATTTAC 


TGTTCGAGGC 


CGTGTCCTTG 


GTAGTAACCT 


1140 


TGTTGCTGAG 


ATCACTGTAC 


GAGTGACAGA 


CAAACTTGGT 


GAGACTCTTT 


CAGATAACCC 


1200 


TAACTATGAT 


GAAAACAGTA 


ACCAGGCCTT 


TGCTTCAGCA 


ACCAATGATA 


TTGACAAAAA 


1260 


CTCTCATGAC 


CGCGTTGACT 


ATCTCAATGA 


CGGAGATCAT 


TCAGAAAATC 


GTCGTTGGAC 


1320 


AAACTGGTCA 


CCAACACCAT 


CTTCTAATCC 


AGAAGTATCA 


GCGGGTGTGA 


TTTTCCGTGA 


1380 


AAATGGTAAG 


ATTGTAGAAC 


GGACTGTTAC 


ACAAGGAAAA 


GTTCAGTTCT 


TTGCAGATAG 


1440 


TGGTACGGAT 


GCACCATCTA 


AACTCGTTTT 


AGAACGCTAT 


GTCGGTCCAG 


AGTTTGAAGT 


1500 


GCCAACCTAC 


TATTCAAACT 


ACCAAGCCTA 


CGACGCAGAC 


CATCCATTCA 


ACAATCCAGA 


1560 


AAATTGGGAA 


GCTGTTCCTT 


ATCGTGCGGA 


TAAAGACATT 


GCAGCTGGTG 


AtGAAATCAA 


1620 


CGTAACATTT 


AAAGCTATCA 


AAGCCAAAGC 


TATGAGATGG 


CGTATGGAGC 


GTAAAGCAGA 


1680 


TAAGAGCGGT 


GTTGCGATGA 


TTGAGATGAC 


CTTCCTTGCA 


CCAAGTGAAT 


TGCCTCAAGA 


1740 


AAGCACTCAA 


TCAAAGATTC 


TTGTAGATGG 


AAAAGAACTT 


GCTGATTTCG 


CTGAAAATCG 


1800 


TCAAGACTAT 


CAAATTACCT 


ATAAAGGTCA 


ACGGCCAAAA 


GTCTCAGTTG 


AAGAAAACAA 


1860 


TCAAGTAGCT 


TCAACTGTGG 


TAGATAGTGG 


AGAAGATAGC 


TTTCCAGTAC 


TTGTTCGCCT 


1920 


CGTTTCAGAA 


AGTGGAAAAC 


AAGTCAAGGA 


ATACCGTATC 


CACTTGACTA 


AGGAAAAACC 


1980 


AGTTTCTGAG 


AAGACAGTTG 


CTGCTGTACA 


AGAAGATCTT 


CCAAAAATCG 


AATTTGTTGA 


2040 


AAAAGATTTG 


GCATACAAGA 


CAGTTGAGAA 


AAAAGATTCA 


ACACTGTATC 


TAGGTGAAAC 


2100 


TCGTGTAGAA 


CAAGAAGGAA 


AAGTTGGAAA 


AGAACGTATC 


TTTACAGCGA 


TTAATCCTGA 


2160 


TGGAAGTAAG 


GAAGAAAAAC 


TCCGTGAAGT 


GGTAGAAGTT 


CCGACAGACC 


GCATCGTCTT 


2220 


GGTTGGAACC 


AAACCAGTAG 


CTCAAGAAGC 


TAAAAAACCA 


CAAGTGTCAG 


AAAAAGCAGA 


2280 


TACAAAACCA 


ATTGATTCAA 


GTGAAGCTAG 


TCAAACTAAT 


AAAGCCCAG 




2329 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Ala Lys Val Ala Trp Asp Ala lie Arg Pro Glu Gin Tyr Ala Lys Glu 
15 10 15 

Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gly Thr Gin Leu Thr Thr 
20 25 30 

Lys Leu His Val Arg Val Ser Ala Gin Thr Glu Gin Gly Ala Asn lie 
35 40 45 
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Ser Asp Gin Trp Thr Gly Ser Glu Leu Pro Leu Ala Phe Ala Ser Asp 
50 55 60 

Ser Asn Pro Ser Asp Pro Val Ser Asn Val Asn Asp Lys Leu lie Ser 
65 70 75 80 

Tyr Asn Asn Gin Pro Ala Asn Arg Trp Thr Asn Trp Asn Arg Thr Asn 
85 90 95 

Pro Glu Ala Ser Val Gly Val Leu Phe Gly Asp Ser Gly lie Leu Ser 
100 105 110 

Lys Arg Ser Val Asp Asn Leu Ser Val Gly Phe His Glu Asp His Gly 
115 120 125 

Val Gly Val Pro Lys Ser Tyr Val lie Glu Tyr Tyr Val Gly Lys Thr 
130 135 140 

Val Pro Thr Ala Pro Lys Asn Pro Ser Phe Val Gly Asn Glu Asp His 
145 150 155 160 

Val Phe Asn Asp Ser Ala Asn Trp Lys Pro Val Thr Asn Leu Lys Ala 
165 170 175 

Pro Ala Gin Leu Lys Ala Gly Glu Met Asn His Phe Ser Phe Asp Lys 
180 185 190 

Val Glu Thr Tyr Ala Val Arg lie Arg Met Val Lys Ala Asp Asn Lys 
195 200 205 

Arg Gly Thr Ser lie Thr Glu Val Gin lie Phe Ala Lys Gin Val Ala 
210 215 220 

Ala Ala Lys Gin Gly Gin Thr Arg lie Gin Val Asp Gly Lys Asp Leu 
225 230 235 240 

Ala Asn Phe Asn Pro Asp Leu Thr Asp Tyr Tyr Leu Glu Ser Val Asp 
245 250 255 

Gly Lys Val Pro Ala Val Thr Ala Ser Val Ser Asn Asn Gly Leu Ala 
260 265 270 

Thr Val Val Pro Ser Val Arg Glu Gly Glu Pro Val Arg Val lie Ala 
275 280 285 

Lys Ala Glu Asn Gly Asp lie Leu Gly Glu Tyr Arg Leu His Phe Thr 
290 295 300 

Lys Asp Lys Ser Leu Leu Ser His Lys Pro Val Ala Ala Val Lys Gin 
305 310 315 320 

Ala Arg Leu Leu Gin Val Gly Gin Ala Leu Glu Leu Pro Thr Lys Val 
325 330 335 

Pro Val Tyr Phe Thr Gly Lys Asp Gly Tyr Glu Thr Lys Asp Leu Thr 
340 345 350 

Val Glu Trp Glu Glu Val Pro Ala Glu Asn Leu Thr Lys Ala Gly Gin 
355 360 365 

Phe Thr Val Arg Gly Arg Val Leu Gly Ser Asn Leu Val Ala Glu lie 
370 375 380 
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Thr Val Arg Val Thr Asp Lys Leu Gly Glu Thr Leu Ser Asp Asn Pro 
385 390 395 400 

Asn Tyr Asp Glu Asn Ser Asn Gin Ala Phe Ala Ser Ala Thr Asn Asp 
405 410 415 

lie Asp Lys Asn Ser His Asp Arg Val Asp Tyr Leu Asn Asp Gly Asp 
420 425 430 

His Ser Glu Asn Arg Arg Trp Thr Asn Trp Ser Pro Thr Pro Ser Ser 
435 440 445 

Asn Pro Glu Val Ser Ala Gly Val lie Phe Arg Glu Asn Gly Lys lie 
450 455 460 

Val Glu Arg Thr Val Thr Gin Gly Lys Val Gin Phe Phe Ala Asp Ser 
465 470 475 480 

Gly Thr Asp Ala Pro Ser Lys Leu Val Leu Glu Arg Tyr Val Gly Pro 
485 490 495 

Glu Phe Glu Val Pro Thr Tyr Tyr Ser Asn Tyr Gin Ala Tyr Asp Ala 
500 505 510 

Asp His Pro Phe Asn Asn Pro Glu Asn Trp Glu Ala Val Pro Tyr Arg 
515 520 525 

Ala Asp Lys Asp lie Ala Ala Gly Asp Glu lie Asn Val Thr Phe Lys 
530 535 540 

Ala lie Lys Ala Lys Ala Met Arg Trp Arg Met Glu Arg Lys Ala Asp 
545 550 555 560 

Lys Ser Gly Val Ala Met lie Glu Met Thr Phe Leu Ala Pro Ser Glu 
565 570 575 

Leu Pro Gin Glu Ser Thr Gin Ser Lys lie Leu Val Asp Gly Lys Glu 
580 585 590 

Leu Ala Asp Phe Ala Glu Asn Arg Gin Asp Tyr Gin lie Thr Tyr Lys 
595 600 605 

Gly Gin Arg Pro Lys Val Ser Val Glu Glu Asn Asn Gin Val Ala Ser 
610 615 620 

Thr Val Val Asp Ser Gly Glu Asp Ser Phe Pro Val Leu Val Arg Leu 
625 630 635 640 

Val Ser Glu Ser Gly Lys Gin Val Lys Glu Tyr Arg lie His Leu Thr 
645 650 655 

Lys Glu Lys Pro Val Ser Glu Lys Thr Val Ala Ala Val Gin Glu Asp 
660 665 670 

Leu Pro Lys lie Glu Phe Val Glu Lys Asp Leu Ala Tyr Lys Thr Val 
675 680 685 

Glu Lys Lys Asp Ser Thr Leu Tyr Leu Gly Glu Thr Arg Val Glu Gin 
690 695 700 

Glu Gly Lys Val Gly Lys Glu Arg lie Phe Thr Ala lie Asn Pro Asp 
705 710 715 720 

Gly Ser Lys Glu Glu Lys Leu Arg Glu Val Val Glu Val Pro Thr Asp 
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725 730 735 

Arg lie Val Leu Val Gly Thr Lys Pro Val Ala Gin Glu Ala Lys Lys 
740 745 750 

Pro Gin Val Ser Glu Lys Ala Asp Thr Lys Pro lie Asp Ser Ser Glu 
755 760 765 

Ala Ser Gin Thr Asn Lys Ala Gin 
770 775 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
CTATCACTAT GTAAATAAAG AGATTATTTC ACAAGAAGCT AAAGATTTAA TTCAGACAGG 60 
AAAGCCTGAC AGGAATGAAG TTGTATATGG TTTGGTGTAT CAAAAAGATC AGTTGCCTCA 120 
AACAGGGACA GAA 133 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Tyr His Tyr Val Asn Lys Glu lie lie Ser Gin Glu Ala Lys Asp Leu 
15 10 15 

lie Gin Thr Gly Lys Pro Asp Arg Asn Glu Val Val Tyr Gly Leu Val 
20 25 30 

Tyr Gin Lys Asp Gin Leu Pro Gin Thr Gly Thr Glu 
35 40 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
TGAGACTCCT CAATCAATAA CAAATCAGGA GCAAGCTAGG ACAGAAAACC AAGTAGTAGA 



60 
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GACAGAGGAA 


GCTCCAAAAG 


AAGAAGCACC 


TAAAACAGAA 


GAAAGTCCAA 


AGGAAGAACC 


120 


AAAATCGGAG 


GTAAAACCTA 


CTGACGACAC 


CCTTCCTAAA 


GTAGAAGAGG 


GGAAAGAAGA 


180 


TTCAGCAGAA 


CCAGCTCCAG 


TTGAAGAAGT 


AGGTGGAGAA 


GTTGAGTCAA 


AACCAGAGGA 


240 


AAAAGTAGCA 


GTTAAGCCAG 


AAAGTCAACC 


ATCAGACAAA 


CCAGCTGAGG 


AATCAAAAGT 


300 


TGAACAAGCA 


GGTGAACCAG 


TCGCGCCAAG 


AGAAGACGAA 


AAGGCACCAG 


TCGAGCCAGA 


360 


AAAGCAACCA 


GAAGCTCCTG 


AAGAAGAGAA 


GGCTGTAGAG 


GAAACACCGA 


AACAAGAAGA 


420 


GTCAACTCCA 


GATACCAAGG 


CTGAAGAAAC 


TGTAGAACCA 


AAAGAGGAGA 


CTGTTAATCA 


480 


ATCTATTGAA 


CAACCAAAAG 


TTGAAACGCC 


TGCTGTAGAA 


AAACAAACAG 


AACCAACAGA 


540 


GGAACCAAAA 


GTTGAACAAG 


CAGGTGAACC 


AGTCGCGCCA 


AGAGAAGACG 


AACAGGCACC 


600 


AACGGCACCA 


GTTGAGCCAG 


AAAAGCAACC 


AGAAGTTCCT 


GAAGAAGAGA 


AGGCTGTAGA 


660 


GGAAACACCG 


AAACCAGAAG 


ATAAAATAAA 


GGGTATTGGT 


ACTAAAGAAC 


CAGTTGATAA 


720 


AAGTGAGTTA 


AATAATCAAA 


TTGATAAAGC 


TAGTTCAGTT 


TCTCCTACTG 


ATTAT 


775 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Glu Thr Pro Gin Ser lie Thr Asn Gin Glu Gin Ala Arg Thr Glu Asn 
15 10 15 

Gin Val Val Glu Thr Glu Glu Ala Pro Lys Glu Glu Ala Pro Lys Thr 
20 25 30 

Glu Glu Ser Pro Lys Glu Glu Pro Lys Ser Glu Val Lys Pro Thr Asp 
35 40 45 

Asp Thr Leu Pro Lys Val Glu Glu Gly Lys Glu Asp Ser Ala Glu Pro 
50 55 60 

Ala Pro Val Glu Glu Val Gly Gly Glu Val Glu Ser Lys Pro Glu Glu 
65 70 75 80 

Lys Val Ala Val Lys Pro Glu Ser Gin Pro Ser Asp Lys Pro Ala Glu 
85 90 95 

Glu Ser Lys Val Glu Gin Ala Gly Glu Pro Val Ala Pro Arg Glu Asp 
100 105 110 

Glu Lys Ala Pro Val Glu Pro Glu Lys Gin Pro Glu Ala Pro Glu Glu 
115 120 125 

Glu Lys Ala Val Glu Glu Thr Pro Lys Gin Glu Glu Ser Thr Pro Asp 
130 135 140 
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Thr Lys Ala Glu Glu Thr Val Glu Pro Lys Glu Glu Thr Val Asn Gin 
145 150 155 160 

Ser He Glu Gin Pro Lys Val Glu Thr Pro Ala Val Glu Lys Gin Thr 
165 170 175 

Glu Pro Thr Glu Glu Pro Lys Val Glu Gin Ala Gly Glu Pro Val Ala 
180 185 190 

Pro Arg Glu Asp Glu Gin Ala Pro Thr Ala Pro Val Glu Pro Glu Lys 
195 200 205 

Gin Pro Glu Val Pro Glu Glu Glu Lys Ala Val Glu Glu Thr Pro Lys 
210 215 220 

Pro Glu Asp Lys He Lys Gly He Gly Thr Lys Glu Pro Val Asp Lys 
225 230 235 240 

Ser Glu Leu Asn Asn Gin He Asp Lys Ala Ser Ser Val Ser Pro Thr 
245 250 255 



Asp Tyr 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

GGATGCTCAA GAAACTGCGG GAGTTCACTA TAAATATGTG GCAGATTCAG AGCTATCATC 60 

AGAAGAAAAG AAGCAGCTTG TCTATGATAT TCCGACATAC GTGGAGAATG ATGATGAAAC 120 

TTATTATCTT GTTTATAAGT TAAATTCTCA AAATCAACTG GCGGAATTGC CAAATACTGG 180 

AAGCAAGAAT GAGAGGCAA 199 
(2) INFORMATION FOR SEQ ID NO: 92: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Asp Ala Gin Glu Thr Ala Gly Val His Tyr Lys Tyr Val Ala Asp Ser 
15 10 15 

Glu Leu Ser Ser Glu Glu Lys Lys Gin Leu Val Tyr Asp He Pro Thr 
20 25 30 
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Tyr Val Glu Asn Asp Asp Glu Thr Tyr Tyr Leu Val Tyr Lys Leu Asn 
35 40 45 

Ser Gin Asn Gin Leu Ala Glu Leu Pro Asn Thr Gly Ser Lys Asn Glu 
50 55 60 

Arg Gin 
65 

(2) INFORMATION FOR SEQ ID NO: 93: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



CGACAAAGGT 


GAGACTGAGG 


TTCAACCAGA 


GTCGCCAGAT 


ACTGTGGTAA 


GTGATAAAGG 


60 


TGAACCAGAG 


CAGGTAGCAC 


CGCTTCCAGA 


ATATAAGGGT 


AATATTGAGC 


AAGTAAAACC 


120 


TGAAACTCCG 


GTTGAGAAGA 


CCAAAGAACA 


AGGTCCAGAA 


AAAACTGAAG 


AAGTTCCAGT 


180 


AAAACCAACA 


GAAGAAACAC 


CAGTAAATCC 


AAATGAAGGT 


ACTACAGAAG 


GAACCTCAAT 


240 


TCAAGAAGCA 


GAAAATCCAG 


TTCAACCTGC 


AGAAGAATCA 


ACAACGAATT 


CAGAGAAAGT 


300 


ATCACCAGAT 


ACATCTAGCA 


AAAATACTGG 


GGAAGTGTCC 


AGTAATCCTA 


GTGATTCGAC 


360 


AACCTCAGTT 


GGAGAATCAA 


ATAAACCAGA 


ACATAATGAC 


TCTAAAAATG 


AAAATTCAGA 


420 


AAAAACTGTA 


GAAGAAGTTC 


CAGTAAATCC 


AAATGAAGGC 


ACAGTAGAAG 


GTACCTCAAA 


480 


TCAAGAAACA 


GAAAAACCAG 


TTCAACCTGC 


AGAAGAAACA 


CAAACAAACT 


CTGGGAAAAT 


540 


AGCTAACGAA 


AATACTGGAG 


AAGTATCCAA 


TAAACCTAGT 


GATTCAAAAC 


CACCAGTTGA 


600 


AGAATCAAAT 


CAACCAGAAA 


AAAACGGAAC 


TGCAACAAAA 


CCAGAAAATT 


CAGGTAATAC 


660 


AACATCAGAG 


AATGGACAAA 


CAGAACCAGA 


ACCATCAAAC 


GGAAATTCAA 


CTGAGGATGT 


720 


TTCAACCGAA 


TCAAACACAT 


CCAATTCAAA 


TGGAAACGAA 


GAAATTAAAC 


AAGAAAATGA 


780 


ACTAGACCCT 


GATAAAAAGG 


TAGAAGAACC 


AGAGAAAACA 


CTTGAATTAA 


GAAAT 


835 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Asp Lys Gly Glu Thr Glu Val Gin Pro Glu Ser Pro Asp Thr Val Val 
1 5 10 . 15 
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Ser Asp Lys Gly Glu Pro Glu Gin Val Ala Pro Leu Pro Glu Tyr Lys 
20 25 30 

Gly Asn lie Glu Gin Val Lys Pro Glu Thr Pro Val Glu Lys Thr Lys 
35 40 45 

Glu Gin Gly Pro Glu Lys Thr Glu Glu Val Pro Val Lys Pro Thr Glu 
50 55 60 

Glu Thr Pro Val Asn Pro Asn Glu Gly Thr Thr Glu Gly Thr Ser lie 
65 70 75 80 

Gin Glu Ala Glu Asn Pro Val Gin Pro Ala Glu Glu Ser Thr Thr Asn 
85 90 95 

Ser Glu Lys Val Ser Pro Asp Thr Ser Ser Lys Asn Thr Gly Glu Val 
100 105 110 

Ser Ser Asn Pro Ser Asp Ser Thr Thr Ser Val Gly Glu Ser Asn Lys 
115 120 125 

Pro Glu His Asn Asp Ser Lys Asn Glu Asn Ser Glu Lys Thr Val Glu 
130 135 140 

Glu Val Pro Val Asn Pro Asn Glu Gly Thr Val Glu Gly Thr Ser Asn 
145 150 155 160 

Gin Glu Thr Glu Lys Pro Val Gin Pro Ala Glu Glu Thr Gin Thr Asn 
165 170 175 

Ser Gly Lys lie Ala Asn Glu Asn Thr Gly Glu Val Ser Asn Lys Pro 
180 185 190 

Ser Asp Ser Lys Pro Pro Val Glu Glu Ser Asn Gin Pro Glu Lys Asn 
195 200 205 

Gly Thr Ala Thr Lys Pro Glu Asn Ser Gly Asn Thr Thr Ser Glu Asn 
210 - 215 220 

Gly Gin Thr Glu Pro Glu Pro Ser Asn Gly Asn Ser Thr Glu Asp Val 
225 230 235 240 

Ser Thr Glu Ser Asn Thr Ser Asn Ser Asn Gly Asn Glu Glu lie Lys 
245 250 255 

Gin Glu Asn Glu Leu Asp Pro Asp Lys Lys Val Glu Glu Pro Glu Lys 
260 265 270 

Thr Leu Glu Leu Arg Asn 
275 

INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC ACTAAACTGA CTGCTGAAAA 60 

ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA GATATAACTC ATTTAACAGA 120 

TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT TCAGCATTAG ACGGAGCGAC 180 

AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC CCAGATGGTT CAGTAGTGAC 240 

GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT GAATCTGTAA CTCAAGAAGC 300 

TACACCAGAG TATAAGCTAG AAAATACACC AGGTGGAGAT AAGGGAGGCA ATACTGGAAG 360 

CTCAGATGCT AATGCGAATG AAGGCGGTGG TAGCCAGGCG GGTGGATCAG CTCACACAGG 420 

TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA GCTACTGAAA AAGAATCAGC 480 

TAAAAATGCC ATTGAAAAAG CAGCCAAGGA CAAGCAGGAT GAAATCAAAG GCGCACCGCT 540 

TTCTGATAAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA GCAGAAAAAC AAGCAGCTCT 600 

CAAAGAGATT GAAAATGCGA AAACTATGGA AGATGTGAAG GAAGCAGAAA CGATTGGAGT 660 

GCAAGCCATT GCCATGGTTA CAGTTCCTAA GAGACCAGTG GCTCCTAAT 709 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii). MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Asn Gin Leu Val Ala Gin Asp Pro Lys Ala Gin Asp Ser Thr Lys Leu 
15 10 15 

Thr Ala Glu Lys Ser Thr Val Lys Ala Pro Ala Gin Arg Val Asp Val 
20 25 30 

Lys Asp lie Thr His Leu Thr Asp Glu Glu Lys Val Lys Val Ala lie 
35 40 45 

Leu Gin Ala Asn Gly Ser Ala Leu Asp Gly Ala Thr lie Asn Val Ala 
50 55 60 

Gly Asp Gly Thr Ala Thr lie Thr Phe Pro Asp Gly Ser Val Val Thr 
65 70 75 80 

lie Leu Gly Lys Asp Thr Val Gin Gin Ser Ala Lys Gly Glu Ser Val 
85 90 95 

Thr Gin Glu Ala Thr Pro Glu Tyr Lys Leu Glu Asn Thr Pro Gly Gly 
100 105 110 

Asp Lys Gly Gly Asn Thr Gly Ser Ser Asp Ala Asn Ala Asn Glu Gly 
115 120 125 

Gly Gly Ser Gin Ala Gly Gly Ser Ala His Thr Gly Ser Gin Asn Ser 
130 135 140 
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Ala Gin Ser Gin Ala Ser Lys Gin Leu Ala Thr Glu Lys Glu Ser Ala 
145 150 155 160 

Lys Asn Ala lie Glu Lys Ala Ala Lys Asp Lys Gin Asp Glu lie Lys 
165 170 175 

Gly Ala Pro Leu Ser Asp Lys Glu Lys Ala Glu Leu Leu Ala Arg Val 
180 - 185 190 

Glu Ala Glu Lys Gin Ala Ala Leu Lys Glu lie Glu Asn Ala Lys Thr 
195 200 205 

Met Glu Asp Val Lys Glu Ala Glu Thr lie Gly Val Gin Ala lie Ala 
210 215 220 

Met Val Thr Val Pro Lys Arg Pro Val Ala Pro Asn 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CAAACAGTCA GCTTCAGGAA CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG 60 

GGGTGCCTTC ACAGAAATCA CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA 120 

CACTGCCAAA ACAGCTGTGA TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG 180 

GAATGCTAAT GCTATCGGCT ACATCTCCTT GGGATCTTTA ACGAAATCTG TCAAGGCTTT 240 

AGAGATTGAT GGTGTCAAGG CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA 300 

ACGTCCCTTC AACATTGTTT GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG 360 

CTTTATCCAC TCCAAACAAG GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA 420 

AACCGAAACC ACGGAATATA CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC 480 

CACTTCAGTA TCTTCTTTAA TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCCAGA 540 

AGTTACGATT GATATTACCT CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA 600 

AACCGCTGAT ATTGGTATGG TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC 660 

CCATGATGCT ATTGCTTTAG ACGGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG 720 

CCAAGTCAGT ATGGCTGAAC TTGCAGACGT TTTTAGTGGC AAATTAACCA CCTGGGACAA 780 

GATTAAA 787 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Lys Gin Ser Ala Ser Gly Thr lie Glu Val lie Ser Arg Glu Asn Gly 
15 10 15 

Ser Gly Thr Arg Gly Ala Phe Thr Glu lie Thr Gly lie Leu Lys Lys 
20 25 30 

Asp Gly Asp Lys Lys lie Asp Asn Thr Ala Lys Thr Ala Val lie Gin 
35 40 45 

Asn Ser Thr Glu Gly Val Leu Ser Ala Val Gin Gly Asn Ala Asn Ala 
50 55 60 

lie Gly Tyr lie Ser Leu Gly Ser Leu Thr Lys Ser Val Lys Ala Leu 
65 70 75 80 

Glu lie Asp Gly Val Lys Ala Ser Arg Asp Thr Val Leu Asp Gly Glu 
85 90 95 

Tyr Pro Leu Gin Arg Pro Phe Asn lie Val Trp Ser Ser Asn Leu Ser 
100 105 110 

Lys Leu Gly Gin Asp Phe lie Ser Phe lie His Ser Lys Gin Gly Gin 
115 120 125 

Gin Val Val Thr Asp Asn Lys Phe lie Glu Ala Lys Thr Glu Thr Thr 
130 135 140 

Glu Tyr Thr Ser Gin His Leu Ser Gly Lys Leu Ser Val Val Gly Ser 
145 150 155 160 

Thr Ser Val Ser Ser Leu Met Glu Lys Leu Ala Glu Ala Tyr Lys Lys 
165 170 175 

Glu Asn Pro Glu Val Thr lie Asp lie Thr Ser Asn Gly Ser Ser Ala 
180 185 190 

Gly lie Thr Ala Val Lys Glu Lys Thr Ala Asp lie Gly Met Val Ser 
195 200 205 

Arg Glu Leu Thr Pro Glu Glu Gly Lys Ser Leu Thr His Asp Ala lie 
210 215 220 

Ala Leu Asp Gly lie Ala Val Val Val Asn Asn Asp Asn Lys Ala Ser 
225 230 235 240 

Gin Val Ser Met Ala Glu Leu Ala Asp Val Phe Ser Gly Lys Leu Thr 
245 250 255 

Thr Trp Asp Lys lie Lys 
260 

INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

ATTCGATGAT GCGGATGAAA AGATGACCCG TGATGAAATT GCCTATATGC TGACAAATAG 60 

TGAAGAAACA TTGGATGCTG ATGAGATTGA GATGCTACAA GGTGTCTTTT CGCTCGATGA 120 

ACTGATGGCA CGAGAGGTTA TGGTTCCTCG AACGGATGCC TTTATGGTGG ATATTCAGGA 180 

TGATAGTCAA GCCATTATCC AAAGTATTTT AAAACAAAAT TATTCTCGTA TCCCGGTTTA 240 

TGATGGGGAT AAGGACAATG TAATTGGAAT CATTCACACC AAGAGTCTCC TTAAGGCAGG 3 00 

CTTTGTGGAC GGTTTTGACA ATATTGTTTG GAAGAGAATT TTACAAGATC CACTTTTTGT 360 

ACCTGAAACT ATTTTTGTGG ATGACTTGCT AAAAGAACTG CGAAATACCC AAAGACAAAT 420 

G 421 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Phe Asp Asp Ala Asp Glu Lys Met Thr Arg Asp Glu lie Ala Tyr Met 
15 10 15 

Leu Thr Asn Ser Glu Glu Thr Leu Asp Ala Asp Glu lie Glu Met Leu 
20 25 30 

Gin Gly Val Phe Ser Leu Asp Glu Leu Met Ala Arg Glu Val Met Val 
35 40 45 

Pro Arg Thr Asp Ala Phe Met Val Asp lie Gin Asp Asp Ser Gin Ala 
50 55 60 

lie lie Gin Ser lie Leu Lys Gin Asn Tyr Ser Arg lie Pro Val Tyr 
65 70 75 80 

Asp Gly Asp Lys Asp Asn Val lie Gly lie lie His Thr Lys Ser Leu 
85 90 95 

Leu Lys Ala Gly Phe Val Asp Gly Phe Asp Asn lie Val Trp Lys Arg 
100 105 110 

lie Leu Gin Asp Pro Leu Phe Val Pro Glu Thr lie Phe Val Asp Asp 
115 120 125 

Leu Leu Lys Glu Leu Arg Asn Thr Gin Arg Gin Met 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 101: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
GGAGAGTCGA TCAAAAGTAG ATGAAGCTGT GTCTAAGTTT GAAAAGGACT CATCTTCTTC 
GTCAAGTTCA GACTCTTCCA CTAAACCGGA AGCTTCAGAT ACAGCGAAGC CAAACAAGCC 
GACAGAACCA GGAGAAAAGG TAGCAGAAGC TAAGAAGAAG GTTGAAGAAG CTGAGAAAAA 
AGCCAAGGAT CAAAAAGAAG AAGATCGTCG TAACTACCCA ACCATTACTT ACAAAACGCT 
TGAACTTGAA ATTGCTGAGT CCGATGTGGA AGTTAAAAAA GCGGAGCTTG AACTAGTAAA 
AGTGAAAGCT AACGAACCTC GAGACGAGCA A 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Glu Ser Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp 
15 10 15 

Ser Ser Ser Ser Ser Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser 
20 25 30 

Asp Thr Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala 
35 40 45 

Glu Ala Lys Lys Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin 
50 55 60 

Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu 
65 70 75 80 

Glu Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu 
85 90 95 

Glu Leu Val Lys Val Lys Ala Asn Glu Pro Arg Asp Glu Gin 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

ATGGACAACA GGAAACTGGG ACGAGGTTAT ATCTGGTAAG ATTGACAAGT ACAAAGATCC 60 

AGATATTCCA ACAGTTGAAT CACAAGAAGT TACGTCAGAC TCTAGTGATA AAGAAATAAC 120 

GGTAAGGTAT GACCGTTTAT CAACACCAGA AAAACCAATC CCACAACCAA ATCCAGAGCA 180 

TCCAAGTGTT CCGACACCAA ACCCAGAACT ACCAAATCAA GAGACTCCAA CACCAGATAA 240 

ACCAACTCCA GAACCAGGTA CTCCAAAAAC TGAAACTCCA GTGAATGCAG ACCCAGAAGT 300 

TCCGACTTAT GAGACAGGTA AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCTAAT 358 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Trp Thr Thr Gly Asn Trp Asp Glu Val lie Ser Gly Lys lie Asp Lys 
15 10 15 

Tyr Lys Asp Pro Asp lie Pro Thr Val Glu Ser Gin Glu Val Thr Ser 
20 25 30 

Asp Ser Ser Asp Lys Glu lie Thr Val Arg Tyr Asp Arg Leu Ser Thr 
35 40 45 

Pro Glu Lys Pro lie Pro Gin Pro Asn Pro Glu His Pro Ser Val Pro 
50 55 60 

Thr Pro Asn Pro Glu Leu Pro Asn Gin Glu Thr Pro Thr Pro Asp Lys 
65 70 75 80 

Pro Thr Pro Glu Pro Gly Thr Pro Lys Thr Glu Thr Pro Val Asn Pro 
85 90 95 

Asp Pro Glu Val Pro Thr Tyr Glu Thr Gly Lys Arg Glu Glu Leu Pro 
100 105 110 

Asn Thr Gly Thr Glu Ala Asn 
115 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
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CGATGGGCTC 


AATCCAACCC 


CAGGTCAAGT 


CTTACCTGAA 


GAGACATCGG 


GAACGAAAGA 


60 


GGGTGACTTA 


TCAGAAAAAC 


CAGGAGACAC 


CGTTCTCACT 


CAAGCGAAAC 


CTGAGGGCGT 


120 


TACTGGAAAT 


ACGAATTCAC 


TTCCGACACC 


TACAGAAAGA 


ACTGAAGTGA 


GCGAGGAAAC 


180 


AAGCCCTTCT 


AGTCTGGATA 


CACTTTTTGA 


AAAAGATGAA 


GAAGCTCAAA 


AAAATCCAGA 


240 


GCTAACAGAT 


GTCTTAAAAG 


AAACTGTAGA 


TACAGCTGAT 


GTGGATGGGA 


CACAAGCAAG 


300 


TCCAGCAGAA 


ACTACTCCTG 


AACAAGTAAA 


AGGTGGAGTG 


AAAGAAAATA 


CAAAAGACAG 


360 


CATCGATGTT 


CCTGCTGCTT 


ATCTTGAAAA 


AGCTGAAGGG 


AAAGGTCCTT 


TCACTGCCGG 


420 


TGTAAACCAA 


GTAATTCCTT 


ATGAACTATT 


CGCTGGTGAT 


GGTATGTTAA 


CTCGTCTATT 


480 


ACTAAAAGCT 


TCGGATAATG 


CTCCTTGGTC 


TGACAATGGT 


ACTGCTAAAA 


ATCCTGCTTT 


540 


ACCTCCTCTT 


GAAGGATTAA 


CAAAAGGGAA 


ATACTTCTAT 


GAAGTAGACT 


TAAATGGCAA 


600 


TACTGTTGGT 


AAACAAGGTC 


AAGCTTTAAT 


TGATCAACTT 


CGCGCTAATG 


GTACTCAAAC 


660 


TTATAAAGCT 


ACTGTTAAAG 


TTTACGGAAA 


TAAAGACGGT 


AAAGCTGACT 


TGACTAATCT 


720 


AGTTGCTACT 


AAAAATGTAG 


ACATCAACAT 


CAATGGATTA 


GTTGCTAAAG 


AAACAGTTCA 


780 


AAAAGCCGTT 


GCAGACAACG 


TTAAAGACAG 


TATCGATGTT 


CCAGCAGCCT 


ACCTAGAAAA 


840 


AGCCAAGGGT 


GAAGGTCCAT 


TCACAGCAGG 


TGTCAACCAT 


GTGATTCCAT 


ACGAACTCTT 


900 


CGCAGGTGAT 


GGCATGTTGA 


CTCGTCTCTT 


GCTCAAGGCA 


TCTGACAAGG 


CACCATGGTC 


960 


AGATAACGGC 


GACGCTAAAA 


ACCCAGCCCT 


ATCTCCACTA 


GGCGAAAACG 


TGAAGACCAA 


1020 


AGGTCAATAC 


TTCTATCAAN 


TAGCCTTGGA 


CGGAAATGTA 


GCTGGCAAAG 


AAAAACAAGC 


1080 


GCTCATTGAC 


CAGTTCCGAG 


CAAANGGTAC 


TCAAACTTAC 


AGCGCTACAG 


TCAATGTCTA 


1140 


TGGTAACAAA 


GACGGTAAAC 


CAGACTTGGA 


CAACATCGTA 


GCAACTAAAA 


AAGTCACTAT 


1200 


TAACATAAAC 


GGTTTAATTT 


CTAAAGAAAC 


AGTTCAAAAA 


GCCGTTGCAG 


ACAACGTTAA 


1260 


NGACAGTATC 


GATGTTCCAG 


CAGCCTACCT 


AGAAAAAGCC 


AAGGGTGAAG 


GTCCATTCAC 


1320 


AGCAGGTGTC 


AACCATGTGA 


TTCCATACGA 


ACTCTTCGCA 


GGTGATGGTA 


TGTTGACTCG 


1380 


TCTCTTGCTC 


AAGGCATCTG 


ACAAGGCACC 


ATGGTCAGAT 


AACGGNGACG 


CTAAAAACCC 


1440 


AGCNCTATCT 


CCACTAGGTG 


AAAACGTGAA 


GACCAAAGGT 


CAATACTTCT 


ATCAANTAGC 


1500 


CTTGGACGGA 


AATGTAGCTG 


GCAAAGAAAA 


ACAAGCGCTC 


ATTGACCAGT 


TCCGAGCAAA 


1560 


CGGTACTCAA 


ACTTACAGCG 


CTACAGTCAA 


TGTCTATGGT 


AACAAAGACG 


GTAAACCAGA 


1620 


CTTGGACAAC 


ATCGTAGCAA 


CTAAAAAAGT 


CACTATTAAG 


ATAAATGTTA 


AAGAAACATC 


1680 


AGACACAGCA 


AATGGTTCAT 


TATCACCTTC 


TAACTCTGGT 


TCTGGCGTGA 


CTCCGATGAA 


1740 


TCACAATCAT 


GCTACAGGTA 


CTACAGATAG 


CATGCCTGCT 


GACACCATGA 


CAAGTTCTAC 


1800 


CAACACGATG 


GCAGGTGAAA 


ACATGGCTGC 


TTCTGCTAAC 


AAGATGTCTG 


ATACGATGAT 


1860 


GTCAGAGGAT 


AAAGCTATG 










1879 
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INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Asp Gly Leu Asn Pro Thr Pro Gly Gin Val Leu Pro Glu Glu Thr Ser 
15 10 15 

Gly Thr Lys Glu Gly Asp Leu Ser Glu Lys Pro Gly Asp Thr Val Leu 
20 25 30 

Thr Gin Ala Lys Pro Glu Gly Val Thr Gly Asn Thr Asn Ser Leu Pro 
35 40 45 . 

Thr Pro Thr Glu Arg Thr Glu Val Ser Glu Glu Thr Ser Pro Ser Ser 
50 55 60 

Leu Asp Thr Leu Phe Glu Lys Asp Glu Glu Ala Gin Lys Asn Pro Glu 
65 70 75 80 

Leu Thr Asp Val Leu Lys Glu Thr Val Asp Thr Ala Asp Val Asp Gly 
85 90 95 

Thr Gin Ala Ser Pro Ala Glu Thr Thr Pro Glu Gin Val Lys Gly Gly 
100 105 110 

Val Lys Glu Asn Thr Lys Asp Ser lie Asp Val Pro Ala Ala Tyr Leu 
115 120 125 

Glu Lys Ala Glu Gly Lys Gly Pro Phe Thr Ala Gly Val Asn Gin Val 
130 135 140 

lie Pro Tyr Glu Leu Phe Ala Gly Asp Gly Met Leu Thr Arg Leu Leu 
145 150 155 160 

Leu Lys Ala Ser Asp Asn Ala Pro Trp Ser Asp Asn Gly Thr Ala Lys 
165 170 175 

Asn Pro Ala Leu Pro Pro Leu Glu Gly Leu Thr Lys Gly Lys Tyr Phe 
180 185 190 

Tyr Glu Val Asp Leu Asn Gly Asn Thr Val Gly Lys Gin Gly Gin Ala 
195 200 205 

Leu lie Asp Gin Leu Arg Ala Asn Gly Thr Gin Thr Tyr Lys Ala Thr 
210 215 220 

Val Lys Val Tyr Gly Asn Lys Asp Gly Lys Ala Asp Leu Thr Asn Leu 
225 230 235 . 240 

Val Ala Thr Lys Asn Val Asp lie Asn lie Asn Gly Leu Val Ala Lys 
245 250 255 

Glu Thr Val Gin Lys Ala Val Ala Asp Asn Val Lys Asp Ser lie Asp 
260 265 270 
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Val Pro Ala Ala Tyr Leu Glu Lys Ala Lys Gly Glu Gly Pro Phe Thr 
275 280 285 

Ala Gly Val Asn His Val lie Pro Tyr Glu Leu Phe Ala Gly Asp Gly 
290 295 300 

Met Leu Thr Arg Leu Leu Leu Lys Ala Ser Asp Lys Ala Pro Trp Ser 
305 310 315 320 

Asp Asn Gly Asp Ala Lys Asn Pro Ala Leu Ser Pro Leu Gly Glu Asn 
325 330 335 

Val Lys Thr Lys Gly Gin Tyr Phe Tyr Gin Xaa Ala Leu Asp Gly Asn 
340 345 350 

Val Ala Gly Lys Glu Lys Gin Ala Leu lie Asp Gin Phe Arg Ala Xaa 
355 360 365 

Gly Thr Gin Thr Tyr Ser Ala Thr Val Asn Val Tyr Gly Asn Lys Asp 
370 375 380 

Gly Lys Pro Asp Leu Asp Asn lie Val Ala Thr Lys Lys Val Thr lie 
385 390 395 400 

Asn lie Asn Gly Leu lie Ser Lys Glu Thr Val Gin Lys Ala Val Ala 
405 410 415 

Asp Asn Val Xaa Asp Ser lie Asp Val Pro Ala Ala Tyr Leu Glu Lys 
.420 425 430 

Ala Lys Gly Glu Gly Pro Phe Thr Ala Gly Val Asn His Val lie Pro 
435 440 445 

Tyr Glu Leu Phe Ala Gly Asp Gly Met Leu Thr Arg Leu Leu Leu Lys 
450 455 460 

Ala Ser Asp Lys Ala Pro Trp Ser Asp Asn Gly Asp Ala Lys Asn Pro 
465 470 475 480 

Ala Leu Ser Pro Leu Gly Glu Asn Val Lys Thr Lys Gly Gin Tyr Phe 
485 490 495 

Tyr Gin Xaa Ala Leu Asp Gly Asn Val Ala Gly Lys Glu Lys Gin Ala 
500 505 510 

Leu lie Asp Gin Phe Arg Ala Asn Gly Thr Gin Thr Tyr Ser Ala Thr 
515 520 525 

Val Asn Val Tyr Gly Asn Lys Asp Gly Lys Pro Asp Leu Asp Asn lie 
530 535 540 

Val Ala Thr Lys Lys Val Thr lie Lys lie Asn Val Lys Glu Thr Ser 
545 550 555 560 

Asp Thr Ala Asn Gly Ser Leu Ser Pro Ser Asn Ser Gly Ser Gly Val 
565 570 575 

Thr Pro Met Asn His Asn His Ala Thr Gly Thr Thr Asp Ser Met Pro 
580 585 590 

Ala Asp Thr Met Thr Ser Ser Thr Asn Thr Met Ala Gly Glu Asn Met 
595 600 605 
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Ala Ala Ser Ala Asn Lys Met Ser Asp Thr Met Met Ser Glu Asp Lys 
610 615 620 

Ala Met 
625 

(2) INFORMATION FOR SEQ ID NO: 107: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 



TTCCAATCAA 


AAACAGGCAG 


ATGGTAAACT 


CAATATCGTG 


ACAACCTTTT 


ACCCTGTCTA 


60 


TGArTTTACC 


AAGCAAGTCG 


CAGGAGATAC 


GGCTAATGTA 


GAACTCCTAA 


TCGGTGCTGG 


120 


GACAGAACCT 


CATGAATACG 


AACCATCTGC 


CAAGGCAGTT 


GCCAAAATCC 


AAGATGCAGA 


180 


TACCTTCGTT 


TATGAAAATG 


AAAACATGGA 


AACATGGGTA 


CCTAAATTGC 


TAGATACCTT 


240 


GGATAAGAAA 


AAAGTGAAAA 


CCATCAAGGC 


GACAGGCGAT 


ATGTTGCTCT 


TGCCAGGTGG 


300 


CGAGGAAGAA 


GAGGGAGACC 


ATGACCATGG 


AGAAGAAGGT 


CATCACCATG 


AGTTTGACCC 


360 


CCATGTTTGG 


TTATCACCAG 


TTCGTGCCAT 


tAAACTAGTA 


GAGCACCATC 


CGCGACACTT 


420 


GTCAGCAGAT 


TATCCTGATA 


AAAAAGAGAC 


CTTTGAGAAG 


AATGCAGCTG 


CCTATATCGA 


480 


AAAATTGCAA 


GCCTTGGATA 


AGGCTTACGC 


AGAAGGTTTG 


TCTCAAGCAA 


AACAAAAGAG 


540 


CTTTGTGACT 


CAACACGCAg 


CCTTTAACTa 


TCTTGCCTTG 


GACTATGGGA 


CTC 


593 



(2) INFORMATION FOR SEQ ID NO:108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Ser Asn Gin Lys Gin Ala Asp Gly Lys Leu Asn lie Val Thr Thr Phe 
15 10 15 

Tyr Pro Val Tyr Glu Phe Thr Lys Gin Val Ala Gly Asp Thr Ala Asn 
20 25 30 

Val Glu Leu Leu lie Gly Ala Gly Thr Glu Pro His Glu Tyr Glu Pro 
35 40 45 

Ser Ala Lys Ala Val Ala Lys lie Gin Asp Ala Asp Thr Phe Val Tyr 
50 55 60 

Glu Asn Glu Asn Met Glu Thr Trp Val Pro Lys Leu Leu Asp Thr Leu 
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65 70 75 80 

Asp Lys Lys Lys Val Lys Thr lie Lys Ala Thr Gly Asp Met Leu Leu 
85 90 95 

Leu Pro Gly Gly Glu Glu Glu Glu Gly Asp His Asp His Gly Glu Glu 
100 105 110 

Gly His His His Glu Phe Asp Pro His Val Trp Leu Ser Pro Val Arg 
115 120 125 

Ala lie Lys Leu Val Glu His His Pro Arg His Leu Ser Ala Asp Tyr 
130 135 140 

Pro Asp Lys Lys Glu Thr Phe Glu Lys Asn Ala Ala Ala Tyr lie Glu 
145 150 155 160 

Lys Leu Gin Ala Leu Asp Lys Ala Tyr Ala Glu Gly Leu Ser Gin Ala 
165 170 175 

Lys Gin Lys Ser Phe Val Thr Gin His Ala Ala Phe Asn Tyr Leu Ala 
180 185 190 

Leu Asp Tyr Gly Thr 
195 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



TATCACAGGA 


TCGAACGGTA 


AGACAACCAC 


AACGACTATG 


ATTGGGGAAG 


TTTTGACTGC 


60 


TGCTGGCCAA 


CATGGTCTTT 


TATCAGGGAA 


TATCGGCTAT 


CCAGCTAGTC 


AGGTTGCTCA 


120 


AATAGCATCA 


GATAAGGACA 


CGCTTGTTAT 


GGAACTTTCT 


TCTTTCCAAC 


TCATGGGTGT 


180 


TCAAGAATTC 


CATCCAGAGA 


TTGCGGTTAT 


TACCAACCTC 


ATGCCAACTC 


ATATCGACTA 


240 


CCATGGGTCA 


TTTTCGGAAT 


ATGTAGCAGC 


CAAGTGGAAT 


ATCCAGAACA 


AGATGACAGC 


300 


AGCTGATTTC 


CTTGTCTTGA 


ACTTTAATCA 


AGACTTGGCA 


AAAGACTTGA 


CTTCCAAGAC 


360 


AGAAGCCACT 


GTTGTACCAT 


TTTCAACACT 


TGAAAAGGTT 


GATGGAGCTT 


ATCTGGAAGA 


420 


TGGTCAACTC 


TACTTCCGTG 


GTGAAGTAGT 


CATGGCAGCG 


AATGAAATCG 


GTGTTCCAGG 


480 


TAGCCACAAT 


GTGGAAAATG 


CCCTTGCGAC 


TATTGCTGTA 


GCCAAGCTTC 


GTGATGTGGA 


540 


CAATCAAACC 


ATCAAGGAAA 


CTCTTTCAGC 


CTTCGGTGGT 


GTCAAACACC 


GTCTCCAGTT 


600 


TGTGGATGAC 


ATCAAGGGTG 


TTAAATTCTA 


TAACGACAGT 


AAATCAACTA 


ATATCTTGGC 


660 


TACTCAAAAA 


GCCTTGTCAG 


GATTTGACAA 


CAGCAAGGTC 


GTCTTGATTG 


CAGGTGGTTT 


720 


GGACCGTGGC 


AATGAGTTTG 


ACGAATTGGT 


GCCAGACATT 


ACTGGACTCA 


AGAAGATGGT 


780 
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CATCCTGGGT CAATCTGCAG AACGTGTCAA ACGGGCAGCA GACAAGGCTG GTGTCGCTTA 840 

TGTGGAGGCG ACAGATATTG CAGATGCGAC CCGCAAGGCC TATGAGCTTG CGACTCAAGG 900 

AGATGTGGTT CTTCTTAGTC CTGCCAATGC TAGCTGGGAT ATGTATGCTA ACTTTGAAGT 960 

ACGTGGCGAC CTCTTTATCG ACACAGTAGC GGAGTTAAAA GAA 1003 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Gly lie Thr Gly Ser Asn Gly Lys Thr Thr Thr Thr Thr Met lie Gly 
15 10 15 

Glu Val Leu Thr Ala Ala Gly Gin His Gly Leu Leu Ser Gly Asn lie 
20 25 30 

Gly Tyr Pro Ala Ser Gin Val Ala Gin lie Ala Ser Asp Lys Asp Thr 
35 40 45 

Leu Val Met Glu Leu Ser Ser Phe Gin Leu Met Gly Val Gin Glu Phe 
50 55 60 

His Pro Glu lie Ala Val lie Thr Asn Leu Met Pro Thr His lie Asp 
65 70 75 80 

Tyr His Gly Ser Phe Ser Glu Tyr Val Ala Ala Lys Trp Asn lie Gin 
85 90 95 

Asn Lys Met Thr Ala Ala Asp Phe Leu Val Leu Asn Phe Asn Gin Asp 
100 105 110 

Leu Ala Lys Asp Leu Thr Ser Lys Thr Glu Ala Thr Val Val Pro Phe 
115 120 125 

Ser Thr Leu Glu Lys Val Asp Gly Ala Tyr Leu Glu Asp Gly Gin Leu 
130 135 140 

Tyr Phe Arg Gly Glu Val Val Met Ala Ala Asn Glu lie Gly Val Pro 
145 150 155 160 

Gly Ser His Asn Val Glu Asn Ala Leu Ala Thr lie Ala Val Ala Lys 
165 170 175 

Leu Arg Asp Val Asp Asn Gin Thr lie Lys Glu Thr Leu Ser Ala Phe 
180 185 190 

Gly Gly Val Lys His Arg Leu Gin Phe Val Asp Asp lie Lys Gly Val 
195 200 205 

Lys Phe Tyr Asn Asp Ser Lys Ser Thr Asn lie Leu Ala Thr Gin Lys 
210 215 220 

Ala Leu Ser Gly Phe Asp Asn Ser Lys Val Val Leu lie Ala Gly Gly 
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225 230 235 240 

Leu Asp Arg Gly Asn Glu Phe Asp Glu Leu Val Pro Asp lie Thr Gly 
245 250 255 

Leu Lys Lys Met Val lie Leu Gly Gin Ser Ala Glu Arg Val Lys Arg 
260 265 270 

Ala Ala Asp Lys Ala Gly Val Ala Tyr Val Glu Ala Thr Asp lie Ala 
275 280 285 

Asp Ala Thr Arg Lys Ala Tyr Glu Leu Ala Thr Gin Gly Asp Val Val 
290 295 300 

Leu Leu Ser Pro Ala Asn Ala Ser Trp Asp Met Tyr Ala Asn Phe Glu 
305 310 315 320 

Val Arg Gly Asp Leu Phe lie Asp Thr Val Ala Glu Leu Lys Glu 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 111: 

AAGTTCATCG AAGATGGTTG GGAAGTCCAC TATATCGGGG ACAAGTGTGG TATCGAACAC 60 

CAAGAAATCC TTAAGTCAGG TTTGGATGTC ACCTTCCATT CTATTGCGAC TGGAAAATTG 120 

CGTCGCTATT TCTCTTGGCA AAATATGCTG GACGTCTTCA AAGTTGGTTG GGGAATTGTC 180 

CAATCGCTCT TTATCATGTT GCGACTGCGT CCACAGACCC TTTTTTCAAA GGGGGGCTTT 240 

GTCTCAGTAC CGCCTGTTAT CGCTGCGCGT GTGTCAGGAG TGCCTGTCTT TATTCACGAA 300 

TCTGACCTGT CTATGGGCTT GGCCAATAAA ATCGCCTATA AATTTGCGAC TAAGATGTAT 360 

TCAACCTTTG AACAAGCTTC GAGTTTGGCT AAGGTTGAGC ATGTGGGAGC GG 412 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Ser Ser Ser Lys Met Val Gly Lys Ser Thr lie Ser Gly Thr Ser Val 
15 10 15 

Val Ser Asn Thr Lys Lys Ser Leu Ser Gin Val Trp Met Ser Pro Ser 
20 25 30 
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He Leu Leu Arg Leu Glu Asn Cys Val Ala He Ser Leu Gly Lys lie 
35 40 45 

Cys Trp Thr Ser Ser Lys Leu Val Gly Glu Leu Ser Asn Arg Ser Leu 
50 55 60 

Ser Cys Cys Asp Cys Val His Arg Pro Phe Phe Gin Arg Gly Ala Leu 
65 70 75 80 

Ser Gin Tyr Arg Leu Leu Ser Leu Arg Val Cys Gin Glu Cys Leu Ser 
85 90 95 

Leu Phe Thr Asn Leu Thr Cys Leu Trp Ala Trp Pro He Lys Ser Pro 
100 105 110 

He Asn Leu Arg Leu Arg Cys He Gin Pro Leu Asn Lys Leu Arg Val 
115 120 125 

Trp Leu Arg Leu Ser Met Trp Glu Arg. 
130 135 

(2) INFORMATION FOR SEQ ID. NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

ATCGCTAGCT AGTGAAATGC AAGAAAGTAC ACGTAAATTC AAGGTTACTG CTGACCTAAC 60 

AGATGCCGGT GTTGGAACGA TTGAAGTTCC TTTGAGCATT GAAGATTTAC CCAATGGGCT 120 

GACCGCTGTG GCGACTCCGC AAAAAATTAC AGTCAAGATT GGTAAGAAGG CTCAGAAGGA 180 

TAAGGTAAAG ATTGTACCAG AGATTGACCC TAGTCAAATT GATAGTCGGG TACAAATTGA 240 

AAATGTCATG GTGTCAGATA AAGAAGTGTC TATTACGAGT GACCAAGAGA CATTGGATAG 300 

AATTGATAAG ATTATCGCTG TTTTGCCAAC TAGCGAACGT ATAACAGGTA ATTACAGTGG 360 

TTCAGTACCT TTGCAGGCAA TCGACCGCAA TGGTGTTGTC TTACCGGCAG TTATCACTCC 420 

GTTTGATACA ATAATGAAGG TGACTACAAA ACCAGTAGCA CCAAGTTCAA GCACATCAAA 480 

TTCAAGTACA AGCAGTTCAT CGGAGACATC TTCGTCAACG AAAGCAACTA GTTCAAAAAC 540 

GAAT 544 
(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Ser Leu Ala Ser Glu Met Gin Glu Ser Thr Arg Lys Phe Lys Val Thr 
15 10 15 

Ala Asp Leu Thr Asp Ala Gly Val Gly Thr lie Glu Val Pro Leu Ser 
20 25 30 

lie Glu Asp Leu Pro Asn Gly Leu Thr Ala Val Ala Thr Pro Gin Lys 
35 .40 45 

lie Thr Val Lys lie Gly Lys Lys Ala Gin Lys Asp Lys Val Lys lie 
50 55 60 

Val Pro Glu lie Asp Pro Ser Gin lie Asp Ser Arg Val Gin lie Glu 
65 70 75 80 

Asn Val Met Val Ser Asp Lys Glu Val Ser lie Thr Ser Asp Gin Glu 
85 90 95 

Thr Leu Asp Arg lie Asp Lys lie lie Ala Val Leu Pro Thr Ser Glu 
100 105 110 

Arg lie Thr Gly Asn Tyr Ser Gly Ser Val Pro Leu Gin Ala lie Asp 
115 120 125 

Arg Asn Gly Val Val Leu Pro Ala Val lie Thr Pro Phe Asp Thr lie 
130 135 140 

Met Lys Val Thr Thr Lys Pro Val Ala Pro Ser Ser Ser Thr Ser Asn 
145 150 155 160 

Ser Ser Thr Ser Ser Ser Ser Glu Thr Ser Ser Ser Thr Lys Ala Thr 
165 170 175 

Ser Ser Lys Thr Asn 
180 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 115: 

GCACCAGATG GGGCACAAGG TTCAGGGATC AGATGTTGAA AAGTACTACT TTACCCAACG 60 

CGGTCTTGAG CAGGCAGGAA TTACCATTCT TCCTTTTGAT GAAAAAAATC TAGACGGTGA 120 

TATGGAAATT ATCGCTGGAA ATGCCTTTCG TCCAGATAAC AACGTCGAAA TTGCCTATGC 180 

GGACCAAAAT GGTATCAGCT ACAAACGTTA CCATGAGTTT CTAGGTAGCT TTATGCGTGA 240 

CTTTGTTAGC ATGGGAGTAG CAGGAGCACA TGGAAAAACT TCAACGACAG GTATGTTGTC 300 

TCATGTCTTG TCTCACATTA CAGATACCAG CTTCTTGATT GGAGATGGGA CAGGTCGTGG 360 

TTCGGCCAAT GCCAAATATT TTGTCTTTGA ATCTGACGAA TATGAGCGTC ACTTCATGCC 420 
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TTACCACCCA 


GAATACTCTA 


TTATCACCAA 


CATTGACTTT 


GACCATCCAG 


ATTATTTCAC 


480 


AAGTCTCGAG 


GATGTTTTTA 


ATGCCTTTAA 


CGACTATGCC 


AAACAAATCA 


CCAAGGGTCT 


540 


TTTTGTCTAT 


GGTGAAGATG 


CTGAATTGCG 


TAAGATTACG 


TCTGATGCAC 


CAATTTATTA 


600 


TTATGGTTTT 


GAAGCTGAAG 


GCAATGACTT 


TGTAGCTAGT 


GATCTTCTTC 


GTTCAATAAC 


660 


TGGTTCAACC 


TTCACCGTTC 


ATTTCCGTGG 


ACAAAACTTG 


GGGCAATTCC 


ACATTCCAAC 


720 


CTTTGGTCGT 


CACAATATCA 


TGAATGCGAC 


AGCCGTTATT 


GGTCTTCTTT 


ACACAGCAGG 


780 


ATTTGATTTG 


AACTTGGTGC 


GTGAGCACTT 


GAAAACATTT 


GCCGGTGTTA 


AACGTCGTTT 


840 


CACTGAGAAA 


ATTGTCAATG 


ATACAGTGAT 


TATCGATGAC 


TTTGCCCACC 


ATCCAACAGA 


900 


AATTATTGCG 


ACCTTGGATG 


CGGCTCGTCA 


GAAATACCCA 


AGCAAGGAAA 


TTGTAGCAGT 


960 


CTTTCAACCG 


CATACCTTTA 


CAAGAACCAT 


TGCCTTGTTG 


GACGACTTTG 


CCCATGCTTT 


1020 


AAACCAAGCA 


GATGCTGTTT 


ATCTAGCGCA 


AATTTATGGC 


TCGGCTCGTG 


AAGTAGATCA 


1080 


TGGTGACGTT 


AAGGTAGAAG 


ACCTAGCCAA 


CAAAATCAAC 


AAAAAACACC 


AAGTGATTAC 


1140 


TGTTGAAAAT 


GTTTCTCCAC 


TCCTAGACCA 


TGACAATGCT 


GTTTACGTCT 


TTATGGGAGC 


1200 


AGGAGACATC 


CAAACCTATG 


AATACTCATT 


TGAGCGTCTC 


TTGTCTAACT 


TGACAAGCAA 


1260 


TGTTCAA 
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(2) INFORMATION FOR SEQ ID NO: 116: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

His Gin Met Gly His Lys Val Gin Gly Ser Asp Val Glu Lys Tyr Tyr 
15 10 15 

Phe Thr Gin Arg Gly Leu Glu Gin Ala Gly lie Thr lie Leu Pro Phe 
20 25 30 

Asp Glu Lys Asn Leu Asp Gly Asp Met Glu lie lie Ala Gly Asn Ala 
35 40 45 

Phe Arg Pro Asp Asn Asn Val Glu lie Ala Tyr Ala Asp Gin Asn Gly 
50 55 60 

lie Ser Tyr Lys Arg Tyr His Glu Phe Leu Gly Ser Phe Met Arg Asp 
65 70 75 80 

Phe Val Ser Met Gly Val Ala Gly Ala His Gly Lys Thr Ser Thr Thr 
85 90 95 

Gly Met Leu Ser His Val Leu Ser His lie Thr Asp Thr Ser Phe Leu 
100 105 110 
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He Gly Asp Gly Thr Gly Arg Gly Ser Ala Asn Ala Lys Tyr Phe Val 
115 120 125 

Phe Glu Ser Asp Glu Tyr Glu Arg His Phe Met Pro Tyr His Pro Glu 
130 135 140 

Tyr Ser He He Thr Asn He Asp Phe Asp His Pro Asp Tyr Phe Thr 
145 150 155 160 

Ser Leu Glu Asp Val Phe Asn Ala Phe Asn Asp Tyr Ala Lys Gin He 
165 170 175 

Thr Lys Gly Leu Phe Val Tyr Gly Glu Asp Ala Glu Leu Arg Lys He 
180 185 190 

Thr Ser Asp Ala Pro He Tyr Tyr Tyr Gly Phe Glu Ala Glu Gly Asn 
195 200 205 

Asp Phe Val Ala Ser Asp Leu Leu Arg Ser He Thr Gly Ser Thr Phe 
210 215 220 

Thr Val His Phe Arg Gly Gin Asn Leu Gly Gin Phe His He Pro Thr 
225 230 235 240 

Phe Gly Arg His Asn He Met Asn Ala Thr Ala Val He Gly Leu Leu 
245 250 255 

Tyr Thr Ala Gly Phe Asp Leu Asn Leu Val Arg Glu His Leu Lys Thr 
260 265 270 

Phe Ala Gly Val Lys Arg Arg Phe Thr Glu Lys He Val Asn Asp Thr 
275 280 285 

Val He He Asp Asp Phe Ala His His Pro Thr Glu He He Ala Thr 
290 295 300 

Leu Asp Ala Ala Arg Gin Lys Tyr Pro Ser Lys Glu He Val Ala Val 
305 310 315 320 

Phe Gin Pro His Thr Phe Thr Arg Thr He Ala Leu Leu Asp Asp Phe 
325 330 335 

Ala His Ala Leu Asn Gin Ala Asp Ala Val Tyr Leu Ala Gin He Tyr 
340 345 350 

Gly Ser Ala Arg Glu Val Asp His Gly Asp Val Lys Val Glu Asp Leu 
355 360 365 

Ala Asn Lys He Asn Lys Lys His Gin Val He Thr Val Glu Asn Val 
370 375 380 

Ser Pro Leu Leu Asp His Asp Asn Ala Val Tyr Val Phe Met Gly Ala 
385 390 395 400 

Gly Asp He Gin Thr Tyr Glu Tyr Ser Phe Glu Arg Leu Leu Ser Asn 
405 410 415 

Leu Thr Ser Asn Val Gin 
420 

INFORMATION FOR SEQ ID NO: 117: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



TTTTAACCCA 


ACTGTTGGTA 


CTTTCCTTTT 


TACTGCAGGA 


TTGAGCTTGT 


GGTTTCTAAA 


AGGGAAAATG 


GAAAGAAACG 


ACTTGTTCAT 


TTTCTGCTGT 


GGGAGTTCAA 


TTGTTGCCGG 


CCAGTGCTTT 


TGGGTTGACC 


AGCCAGATTT 


TAATAGTCAG 


CTTTCTATCG 


GAGTCGGGGA 


ACATTTACCA 


GAGCCTCTGA 


TTATCAATAT 


ATTGGTTATA 


TCAAAACTAA 


GAAACAGGAT 


AATACAGAGC 


AGTTGATGGG 


AAATACTCTG 


CTCAAAGAGA 


TAGTCAACCA 


AACTCTACAA 


TGTAGTTCAT 


TCAGCTGATT 


TAGAATGGAA 


CCAAGGACAG 


GGGAAGGTTA 


TGAAGCATCA 


GGGGATGATG 


GACTTTCAGA 


AAAATCTTCT 


ATAGCAGCAG 


TTCTAATGAT 


TCATTCGCAA 


GTCAAGTTGA 


GCAGAATCCG 


GATCACAAAG 


AGTTCGACCA 


ACAGTGCCAG 


AACAAGGAAA 


TCCTGTGTCT 


GCTACAACGG 


GGAAGAGGAA 


GTATTGGCGA 


CGACAAATGA 


TCGACCAGAG 


TATAAACTTC 


CAAAGGCACG 


CAAGAACCCG 


GTCATGAGGG 


TGAAGCCGCA 


GTCCGTGAAG 


CTACACTAAG 


CCACTAGAAA 


CCAAAGGTAC 


ACAAGGACCC 


GGACATGAAG 


AGTTCGCGAG 


GAAGAACCAG 


CTTACACAGA 


ACCGTTAGCA 


ACGAAAGGCA 


AGGTCATGAG 


GGCAAAGCTA 


CAGTCCGCGA 


AGAGACTCTA 


GAGTACACGG 


GACAAAAGGC 


ACACAAGAAC 


CCGAACATGA 


GGGCGAaCGG 


SCAGTAGAAG 


GGCTTTAGAG 


GTCACTACAC 


GAAATAGAAC 


GGAAATCCAG 


AATATTCCTT 


AGAAATTCAG 


GATCCAACAC 


TTCTGAAAAA 


TCGTCGTAAG 


ATTGAACGAC 


AGGGACACGT 


ACAATTCAAT 


ATGAAGACTA 


CATCGTAAAT 


GGTAATGTCG 


AGAAGTGTCA 


CGAACTGAAG 


TAGCTCCGGT 


CAACGAAGTC 


GTTAAAGTAG 


GAAAGTTAAA 


CCTACAGTAG 


AAATTACAAA 


CTTAACAAAA 


GTTGAGAACA 


AACTGTAAGT 


TATAACTTAA 


TAGACACTAC 


CTCAGCATAT 


GTTTCTGCAA 


TTTCCATGGA 


GACAAGCTAG 


TTAAAGAGGT 


GGATATAGAA 


AATCCTGCCA 


AATATCAGGT 


TTAGATTACT 


ACACACCGTA 


TACAGTTAAA 


ACACACCTAA 


GGGTGAAAAT 


AATGAGGAAA 


ATACTGAAAC 


ATCAACTCAA 


GATTTCCAAT 


GAAAATAGAG 


ATTAAAGATA 


TTGATTCAGT 


AGAATTATAC 


GGTAAAGAAA 


TCGTAGATAT 


TTAAGTCTAA 


GTGAAGCGCC 


GACTGATACG 


GCTAAATACT 


GAAATCAGAT 


CGCTTCAAAG 


AAATGTACCT 


ACCTGTAAAA 


TCTATTACAG 



i 



TAGTTTTATT 60 

TGACTAGCAT 120 

TATCTGCCTA 180 

AAATCGAAGG 240 

TTTCAAGGAC 300 

AAACATCAGA 360 

GTTTACAAGG 420 

ACAATCTATC 480 

GAGAATCTGT 540 

TGCAGAGTGC 600 

CATTGGAAAC 660 

ACTTACCAGT 720 

GTGAAGCTGC 780 

CGCAAGAGCC 840 

AACCGGTAGC 900 

AAGAACTTCC 960 

ATACAACAGA 1020 

AAGGGCAAGC 1080 

TAGAAACTAA 1140 

GAACACTTGT 1200 

AAAAATCTAT 1260 

AAACGCAAGT 1320 

AAGAGCAAGT 1380 

CTTATAATTT 1440 

TAGAGTATAA 1500 

ATGATCGTTA 1560 

TTGTAAAAGT 1620 

AAAATACGGA 1680 
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TGGAACGTAT 


AAAGTGACGG 


TAGCCGTTGA 


TCAACTTGTC 


GAAGAAGGTA 


CAGACGGTTA 


1740 


CAAAGATGAT 


TACACATTTA 


CTGTAGCTAA 


ATCTAAAGCA 


GAGCAACCAG 


GAGTTTACAC 


1800 


ATCCTTTAAA 


CAGCTGGTAA 


CAGCCATGCA 


AAGCAATCTG 


TCTGGTGTCT 


ATACATTGGC 


1860 


TTCAGATATG 


ACCGCAGATG 


AGGTGAGCTT 


AGGCGATAAG 


CAGACAAGTT 


ATCTCACAGG 


1920 


TGCATTTACA 


GGGAGCTTGA 


TCGGTTCTGA 


TGGAACAAAA 


TCGTATGCCA 


TTTATGATTT 


1980 


GAAGAAACCA 


TTATTTGATA 


CATTAAATGG 


TGCTACAGTT 


AGAGATTTGG 


ATATTAAAAC 


2040 


TGTTTCTGCT 


GATAGTAAAG 


AAAATGTCGC 


AGCGCTGGCG 


AAGGCAGCGA 


ATAGCGCGAA 


2100 


TATTAATAAT 


GTTGCAGTAG 


AAGGAAAAAT 


CTCAGGTGCG 


AAATCTGTTG 


CGGGATTAGT 


2160 


AGCGAGCGCA 


ACAAATACAG 


TGATAGAAAA 


CAGCTCGTTT 


ACAGGGAAAC 


TTATCGCAAA 


2220 


TCACCAGGAC 


AGTAATAAAA 


ATGATACTGG 


AGGAATAGTA 


GGTAATATAA 


CAGGAAATAG 


2280 


TTCGAGAGTT 


AATAAAGTTA 


GGGTAGATGC 


CTTAATCTCT 


ACTAATGCAC 


GCAATAATAA 


2340 


CCAAACAGCT 


GGAGGGATAG 


TAGGTAGATT 


AGAAAATGGT 


GCATTGATAT 


CTAATTCGGT 


2400 


TGCTACTGGA 


GAAATACGAA 


ATGGTCAAGG 


ATATTCTAGA 


GTCGGAGGAA 


TAGTAGGATC 


2460 


TACGTGGCAA 


AACGGTCGAG 


TAAATAATGT 


TGTGAGTAAC 


GTAGATGTTG 


GAGATGGTTA 


2520 


TGTTATCACC 


GGTGATCAAT 


ACGCAGCAGC 


AGATGTGAAA 


AATGCAAGTA 


CATCAGTTGA 


2580 


TAATAGT^AAA 


GCAGACAGAT 


TCGCTACAAA 


ATTATCAAAA 


GACCAAATAG 


ACGCGAAAGT 


2640 


TGCTGATTAT 


GGAATCACAG 


TAACTCTTGA 


TGATACTGGG 


CAAGATTTAA 


AACGTAATCT 


2700 


AAGAGAAGTT 


GATTATACAA 


GACTAAATAA 


AGCAGAAGCT 


GAAAGAAAAG 


TAGCTTATAG 


2760 


CAACATAGAA 


AAACTGATGC 


CATTCTACAA 


TAAAGACCTA 


GTAGTTCACT 


ATGGTAACAA 


2820 


AGTAGCGACA 


ACAGATAAAC 


TTTACACTAC 


AGAATTGTTA 


GATGTTGTGC 


CGATGAAAGA 


2880 


TGATGAAGTA 


GTAACGGATA 


TTAATAATAA 


GAAAAATTCA 


ATAAATAAAG 


TTATGTTACA 


2940 


TTTCAAAGAT 


AATACAGTAG 


AATACCTAGA 


TGTAACATTC 


AAAGAAAACT 


TCATAAACAG 


3000 


TCAAGTAATC 


GAATACAATG 


TTACAGGAAA 


AGAATATATA 


TTCACACCAG 


AAGCATTTGT 


3060 


TTCAGACTAT 


ACAGCGATAA 


CGAATAACGT 


ACTAAGCGAC 


TTGCAAAATG 


TAACACTTAA 


3120 


C 
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(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1040 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
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Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Thr Ala Gly Leu Ser Leu 
1 5 10 15 

Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gly Lys Lys Arg Leu Val 
20 25 30 

His Phe Leu Leu Leu Thr Ser Met Gly Val Gin Leu Leu Pro Ala Ser 
35 40 45 

Ala Phe Gly Leu Thr Ser Gin lie Leu Ser Ala Tyr Asn Ser Gin Leu 
50 55 60 

Ser lie Gly Val Gly Glu His Leu Pro Glu Pro Leu Lys lie Glu Gly 
65 70 75 80 

Tyr Gin Tyr lie Gly Tyr lie Lys Thr Lys Lys Gin Asp Asn Thr Glu 
85 90 95 

Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Ala Gin Arg Asp Ser Gin 
100 105 110 

Pro Asn Ser Thr Lys Thr Ser Asp Val Val His Ser Ala Asp Leu Glu 
115 120 125 

Trp Asn Gin Gly Gin Gly Lys Val Ser Leu Gin Gly Glu Ala Ser Gly 
130 135 140 

Asp Asp Gly Leu Ser Glu Lys Ser Ser lie Ala Ala Asp Asn Leu Ser 
145 150 155 160 

Ser Asn Asp Ser Phe Ala Ser Gin Val Glu Gin Asn Pro Asp His Lys 
165 170 175 

Gly Glu Ser Val Val Arg Pro Thr Val Pro Glu Gin Gly Asn Pro Val 
180 185 190 

Ser Ala Thr Thr Val Gin Ser Ala Glu Glu Glu Val Leu Ala Thr Thr 
195 200 205 

Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Glu Thr Lys Gly Thr Gin 
210 215 220 

Glu Pro Gly His Glu Gly Glu Ala Ala Val Arg Glu Asp Leu Pro Val 
225 230 235 240 

Tyr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gin Gly Pro Gly His Glu 
245 250 255 

Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Ala Tyr Thr Glu Pro Leu 
260 265 270 

Ala Thr Lys Gly Thr Gin Glu Pro Gly His Glu Gly Lys Ala Thr Val 
275 280 285 

Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Val Ala Thr Lys Gly Thr 
290 295 300 

Gin Glu Pro Glu His Glu Gly Glu Arg Xaa Val Glu Glu Glu Leu Pro 
305 310 315 320 

Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Glu lie Gin Asn lie Pro 
325 330 335 

Tyr Thr Thr Glu Glu lie Gin Asp Pro Thr Leu Leu Lys Asn Arg Arg 
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340 345 350 

Lys lie Glu Arg Gin Gly Gin Ala Gly Thr Arg Thr lie Gin Tyr Glu 
355 360 365 

Asp Tyr He Val Asn Gly Asn Val Val Glu Thr Lys Glu Val Ser Arg 
370 375 380 

Thr Glu Val Ala Pro Val Asn Glu Val Val Lys Val Gly Thr Leu Val 
385 390 395 400 

Lys Val Lys Pro Thr Val Glu He Thr Asn Leu Thr Lys Val Glu Asn 
405 410 415 

Lys Lys Ser He Thr Val Ser Tyr Asn Leu He Asp Thr Thr Ser Ala 
420 425 430 

Tyr Val Ser Ala Lys Thr Gin Val Phe His Gly Asp Lys Leu Val Lys 
435 440 445 

Glu Val Asp He Glu Asn Pro Ala Lys Glu Gin Val He Ser Gly Leu 
450 455 460 

Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr His Leu Thr Tyr Asn Leu 
465 470 475 480 

Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Ser Thr Gin Asp Phe Gin 
485 490 495 

Leu Glu Tyr Lys Lys He Glu He Lys Asp He Asp Ser Val Glu Leu 
500 505 510 

Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg Tyr Leu Ser Leu Ser Glu 
515 520 525 

Ala Pro Thr Asp Thr Ala Lys Tyr Phe Val Lys Val Lys Ser Asp Arg 
530 535 540 

Phe Lys Glu Met Tyr Leu Pro Val Lys Ser He Thr Glu Asn Thr Asp 
545 550 555 560 

Gly Thr Tyr Lys Val Thr Val Ala Val Asp Gin Leu Val Glu Glu Gly 
565 570 575 

Thr Asp Gly Tyr Lys Asp Asp Tyr Thr Phe Thr Val Ala Lys Ser Lys 
580 585 590 

Ala Glu Gin Pro Gly Val Tyr Thr Ser Phe Lys Gin Leu Val Thr Ala 
595 600 605 

Met Gin Ser Asn Leu Ser Gly Val Tyr Thr Leu Ala Ser Asp Met Thr 
610 615 620 

Ala Asp Glu Val Ser Leu Gly Asp Lys Gin Thr Ser Tyr Leu Thr Gly 
625 630 635 640 

Ala Phe Thr Gly Ser Leu He Gly Ser Asp Gly Thr Lys Ser Tyr Ala 
645 650 655 

He Tyr Asp Leu Lys Lys Pro Leu Phe Asp Thr Leu Asn Gly Ala Thr 
660 665 670 

Val Arg Asp Leu Asp He Lys Thr Val Ser Ala Asp Ser Lys Glu Asn 
675 680 685 
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Val Ala Ala Leu Ala Lys Ala Ala Asn Ser Ala Asn lie Asn Asn Val 
690 695 700 

Ala Val Glu Gly Lys He Ser Gly Ala Lys Ser Val Ala Gly Leu Val 
705 710 715 720 

Ala Ser Ala Thr Asn Thr Val He Glu Asn Ser Ser Phe Thr Gly Lys 
725 730 735 

Leu He Ala Asn His Gin Asp Ser Asn Lys Asn Asp Thr Gly Gly He 
740 745 750 

Val Gly Asn He Thr Gly Asn Ser Ser Arg Val Asn Lys Val Arg Val 
755 760 765 

Asp Ala Leu He Ser Thr Asn Ala Arg Asn Asn Asn Gin Thr Ala Gly 
770 775 780 

Gly He Val Gly Arg Leu Glu Asn Gly Ala Leu He Ser Asn Ser Val 
785 790 795 800 

Ala Thr Gly Glu He Arg Asn Gly Gin Gly Tyr Ser Arg Val Gly Gly 
805 810 815 

t. 

He Val Gly Ser Thr Trp Gin Asn Gly Arg Val Asn Asn Val Val Ser 
820 825 830 

Asn Val Asp Val Gly Asp Gly Tyr Val He Thr Gly Asp Gin Tyr Ala 
835 840 845 

Ala Ala Asp Val Lys Asn Ala Ser Thr Ser Val Asp Asn Arg Lys Ala 
850 855 860 

Asp Arg Phe Ala Thr Lys Leu Ser. Lys Asp Gin He Asp Ala Lys Val 
865 870 875 880 

Ala Asp Tyr Gly He Thr Val Thr Leu Asp Asp Thr Gly Gin Asp Leu 
885 890 895 

Lys Arg Asn Leu Arg Glu Val Asp Tyr Thr Arg Leu Asn Lys Ala Glu 
900 905 910 

Ala Glu Arg Lys Val Ala Tyr Ser Asn He Glu Lys Leu Met Pro Phe 
915 920 925 

Tyr Asn Lys Asp Leu Val Val His Tyr Gly Asn Lys Val Ala Thr Thr 
930 935 940 

Asp Lys Leu Tyr Thr Thr Glu Leu Leu Asp Val Val Pro Met Lys Asp 
945 950 955 960 

Asp Glu Val Val Thr Asp He Asn Asn Lys Lys Asn Ser He Asn Lys 
965 970 975 

Val Met Leu His Phe Lys Asp Asn Thr Val Glu Tyr Leu Asp Val Thr 
980 985 990 

Phe Lys Glu Asn Phe He Asn Ser Gin Val He Glu Tyr Asn Val Thr 
995 1000 1005 

Gly Lys Glu Tyr He Phe Thr Pro Glu Ala Phe Val Ser Asp Tyr Thr 
1010 1015 1020 
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Ala lie Thr Asn Asn Val Leu Ser Asp Leu Gin Asn Val Thr Leu Asn 
1025 1030 1035 1040 

(2) INFORMATION FOR SEQ ID NO: 119: • ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



TTTTAACCCA 


ACTGTTGGTA 


CTTTCCTTTT 


TACTGCAGGA 


TTGAGCTTGT 


TAGTTTTATT 


60 


GGTTTCTAAA 


AGGGAAAATG 


GAAAGAAACG 


ACTTGTTCAT 


TTTCTGCTGT 


TGACTAGCAT 


120 


GGGAGTTCAA 


TTGTTGCCGG 


CCAGTGCTTT 


TGGGTTGACC 


AGCCAGATTT 


TATCTGCCTA 


180 


TAATAGTCAG 


CTTTCTATCG 


GAGTCGGGGA 


ACATTTACCA 


GAGCCTCTGA 


AAATCGAAGG 


240 


TTATCAATAT 


ATTGGTTATA 


TCAAAACTAA 


GAAACAGGAT 


AATACAGAGC 


TTTCAAGGAC 


300 


AGTTGATGGG 


AAATACTCTG 


CTCAAAGAGA 


TAGTCAACCA 


AACTCTACAA 


AAACATCAGA 


360 


TGTAGTTCAT 


TCAGCTGATT 


TAGAATGGAA 


CCAAGGACAG 


GGGAAGGTTA 


GTTTACAAGG 


420 


TGAAGCATCA 


GGGGATGATG 


GACTTTCAGA 


AAAATCTTCT 


ATAGCAGCAG 


ACAATCTATC 


480 


TTCTAATGAT 


TCATTCGCAA 


GTCAAGTTGA 


GCAGAATCCG 


GATCACAAAG 


GAGAATCTGT 


540 


AGTTCGACCA 


ACAGTGCCAG 


AACAAGGAAA 


TCCTGTGTCT 


GCTACAACGG 


TGCAGAGTGC 


600 


GGAAGAGGAA 


GTATTGGCGA 


CGACAAATGA 


TCGACCAGAG 


TATAAACTTC 


CATTGGAAAC 


660 


CAAAGGCACG 


CAAGAACCCG 


GTCATGAGGG 


TGAAGCCGCA 


GTCCGTGAAG 


ACTTACCAGT 


720 


CTACACTAAG 


CCACTAGAAA 


CCAAAGGTAC 


ACAAGGACCC 


GGACATGAAG 


GTGAAGCTGC 


780 


AGTTCGCGAG 


GAAGAACCAG 


CTTACACAGA 


ACCGTTAGCA 


ACGAAAGGCA 


CGCAAGAGCC 


840 


AGGTCATGAG 


GGCAAAGCTA 


CAGTCCGCGA 


AGAGACTCTA 


GAGTACACGG 


AACCGGTAGC 


900 


GACAAAAGGC 


ACACAAGAAC 


CCGAACATGA 


GGGCGAaCGG 


sCAGTAGAAG 


AAGAACTTCC 


960 


GGCTTTAGAG 


GTCACTACAC 


GAAATAGAAC 


GGAAATCCAG 


AATATTCCTT 


ATACAACAGA 


1020 


AGAAATTCAG 


GATCCAACAC 


TTCTGAAAAA 


TCGTCGTAAG 


ATTGAACGAC 


AAGGGCAAGC 


1080 


AGGGACACGT 


ACAATTCAAT 


ATGAAGACTA 


CATCGTAAAT 


GGTAATGTCG 


TAGAAACTAA 


1140 


AGAAGTGTCA 


CGAACTGAAG 


TAGCTCCGGT 


CAACGAAGTC 


GTTAAAGTAG 


GAACACTTGT 


1200 


GAAAGTTAAA 


CCTACAGTAG 


AAATTACAAA 


CTTAACAAAA 


GTTGAGAACA 


AAAAATCTAT 


1260 


AACTGTAAGT 


TATAACTTAA 


TAGACACTAC 


CTCAGCATAT 


GTTTCTGCAA 


AAACGCAAGT 


1320 


TTTCCATGGA 


GACAAGCTAG 


TTAAAGAGGT 


GGATATAGAA 


AATCCTGCCA 


AAGAGCAAGT 


1380 


AATATCAGGT 


TTAGATTACT 


ACACACCGTA 


TACAGTTAAA 


ACACACCTAA 


CTTATAATTT 


1440 


GGGTGAAAAT 


AATGAGGAAA 


ATACTGAAAC 


ATCAACTCAA 


GATTTCCAAT 


TAGAGTATAA 


1500 
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GAAAATAGAG ATTAAAGATA TTGATTCAGT AGAATTATAC GGTAAAGAAA ATGATCGTTA 1560 
TCGTAGA 1567 
(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Thr Ala Gly Leu Ser Leu 
1 5 10 15 

Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gly Lys Lys Arg Leu Val 
20 25 30 

His Phe Leu Leu Leu Thr Ser Met Gly Val Gin Leu Leu Pro Ala Ser 
35 40 45 

Ala Phe Gly Leu Thr Ser Gin lie Leu Ser Ala Tyr Asn Ser Gin Leu 
50 55 60 

Ser lie Gly Val Gly Glu His Leu Pro Glu Pro Leu Lys lie Glu Gly 
65 70 75 80 

Tyr Gin Tyr lie Gly Tyr lie Lys Thr Lys Lys Gin Asp Asn Thr Glu 
85 90 95 

Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Ala Gin Arg Asp Ser Gin 
100 105 110 

Pro Asn Ser Thr Lys Thr Ser Asp Val Val His Ser Ala Asp Leu Glu 
115 120 125 

Trp Asn Gin Gly Gin Gly Lys Val Ser Leu Gin Gly Glu Ala Ser Gly 
130 135 140 

Asp Asp Gly Leu Ser Glu Lys Ser Ser lie Ala Ala Asp Asn Leu Ser 
145 150 155 160 

Ser Asn Asp Ser Phe Ala Ser Gin Val Glu Gin Asn Pro Asp His Lys 
165 170 175 

Gly Glu Ser Val Val Arg Pro Thr Val Pro Glu Gin Gly Asn Pro Val 
180 185 190 

Ser Ala Thr Thr Val Gin Ser Ala Glu Glu Glu Val Leu Ala Thr Thr 
195 200 205 

Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Glu Thr Lys Gly Thr Gin 
210 215 220 

Glu Pro Gly His Glu Gly Glu Ala Ala Val Arg Glu Asp Leu Pro Val 
225 230 235 240 

Tyr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gin Gly Pro Gly His Glu 
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245 250 255 

Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Ala Tyr Thr Glu Pro Leu 
260 265 270 

Ala Thr Lys Gly Thr Gin Glu Pro Gly His Glu Gly Lys Ala Thr Val 
275 280 285 

Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Val Ala Thr. Lys Gly Thr 
290 295 300 

Gin Glu Pro Glu His Glu Gly Glu Arg Xaa Val Glu Glu Glu Leu Pro 
305 310 315 320 

Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Glu lie Gin Asn lie Pro 
325 330 335 

Tyr Thr Thr Glu Glu lie Gin Asp Pro Thr Leu Leu Lys Asn Arg Arg 
340 345 350 

Lys lie Glu Arg Gin Gly Gin Ala Gly Thr Arg Thr lie Gin Tyr Glu 
355 360 365 

Asp Tyr lie Val Asn Gly Asn Val Val Glu Thr Lys Glu Val Ser Arg 
370 375 380 

Thr Glu Val Ala Pro Val Asn Glu Val Val Lys Val Gly Thr Leu Val 
385 390 395 400 

Lys Val Lys Pro Thr Val Glu lie Thr Asn Leu Thr Lys Val Glu Asn 
405 410 415 

Lys Lys Ser lie Thr Val Ser Tyr Asn Leu lie Asp Thr Thr Ser Ala 
420 425 430 

Tyr Val Ser Ala Lys Thr Gin Val Phe His Gly Asp Lys Leu Val Lys 
435 440 445 

Glu Val Asp lie Glu Asn Pro Ala Lys Glu Gin Val lie Ser Gly Leu 
450 455 460 

Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr His Leu Thr Tyr Asn Leu 
465 470 475 480 

Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Ser Thr Gin Asp Phe Gin 
485 490 495 

Leu Glu Tyr Lys Lys lie Glu lie Lys Asp lie Asp Ser Val Glu Leu 
500 505 510 

Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg 
515 520 

INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
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TCGTAGATAT TTAAGTCTAA GTGAAGCGCC GACTGATACG GCTAAATACT TTGTAAAAGT 60 

GAAATCAGAT CGCTTCAAAG AAATGTACCT ACCTGTAAAA TCTATTACAG AAAATACGGA 120 

TGGAACGTAT AAAGTGACGG TAGCCGTTGA TCAACTTGTC GAAGAAGGTA CAGACGGTTA 180 

CAAAGATGAT TACACATTTA CTGTAGCTAA ATCTAAAGCA GAGCAACCAG GAGTTTACAC 240 

ATCCTTTAAA CAGCTGGTAA CAGCCATGCA AAGCAATCTG TCTGGTGTCT ATACATTGGC 300 

TTCAGATATG ACCGCAGATG AGGTGAGCTT AGGCGATAAG CAGACAAGTT ATCTCACAGG 360 

TGCATTTACA GGGAGCTTGA TCGGTTCTGA TGGAACAAAA TCGTATGCCA TTTATGATTT 420 

GAAGAAACCA TTATTTGATA CATTAAATGG TGCTACAGTT AGAGATTTGG ATATTAAAAC 480 

TGTTTCTGCT GATAGTAAAG AAAATGTCGC AGCGCTGGCG AAGGCAGCGA ATAGCGCGAA 540 

TATTAATAAT GTTGCAGTAG AAGGAAAAAT CTCAGGTGCG AAATCTGTTG CGGGATTAGT 600 

AGCGAGCGCA ACAAATACAG TGATAGAAAA CAGCTCGTTT ACAGGGAAAC TTATCGCAAA 660 

TCACCAGGAC AGTAATAAAA ATGATACTGG AGGAATAGTA GGTAATATAA CAGGAAATAG 720 

TTCGAGAGTT AATAAAGTTA GGGTAGATGC CTTAATCTCT ACTAATGCAC GCAATAATAA 780 

CCAAACAGCT GGAGGGATAG TAGGTAGATT AGAAAATGGT GCATTGATAT CTAATTCGGT '^840 

TGCTACTGGA GAAATACGAA ATGGTCAAGG ATATTCTAGA GTCGGAGGAA TAGTAGGATC 900 

TACGTGGCAA AACGGTCGAG TAAATAATGT TGTGAGTAAC GTAGATGTTG GAGATGGTTA 960 

TGTTATCACC GGTGATCAAT ACGCAGCAGC AGATGTGAAA AATGCAAGTA CATCAGTTGA 1020 

TAATAGAAAA GCAGACAGAT TCGCTACAAA ATTATCAAAA GACCAAATAG ACGCGAAAGT 1080 

TGCTGATTAT GGAATCACAG TAACTCTTGA TGATACTGGG CAAGATTTAA AACGTAATCT 1140 

AAGAGAAGTT GATTATACAA GACTAAATAA AGCAGAAGCT GAAAGAAAAG TAGCTTATAG 1200 

CAACATAGAA AAACTGATGC CATTCTACAA TAAAGACCTA GTAGTTCACT ATGGTAACAA 1260 

AGTAGCGACA ACAGATAAAC TTTACACTAC AGAATTGTTA GATGTTGTGC CGATGAAAGA 1320 

TGATGAAGTA GTAACGGATA TTAATAATAA GAAAAATTCA ATAAATAAAG TTATGTTACA 1380 

TTTCAAAGAT AATACAGTAG AATACCTAGA TGTAACATTC AAAGAAAACT TCATAAACAG 1440 

TCAAGTAATC GAATACAATG TTACAGGAAA AGAATATATA TTCACACCAG AAGCATTTGT 1500 

TTCAGACTAT ACAGCGATAA CGAATAACGT ACTAAGCGAC TTGCAAAATG TAACACTTAA 1560 

C 1561 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Arg Arg Tyr Leu Ser Leu Ser Glu Ala Pro Thr Asp Thr Ala Lys Tyr 
15 10 15 

Phe Val Lys Val Lys Ser Asp Arg Phe Lys Glu Met Tyr Leu Pro Val 
20 25 30 

Lys Ser lie Thr Glu Asn Thr Asp Gly Thr Tyr Lys Val Thr Val Ala 
35 40 . 45 

Val Asp Gin Leu Val Glu Glu Gly Thr Asp Gly Tyr Lys Asp Asp Tyr 
50 55 60 

Thr Phe Thr Val Ala Lys Ser Lys Ala Glu Gin Pro Gly Val Tyr Thr 
65 70 75 80 

Ser Phe Lys Gin Leu Val Thr Ala Met Gin Ser Asn Leu Ser Gly Val 
85 90 95 

Tyr Thr Leu Ala Ser Asp Met Thr Ala Asp Glu Val Ser Leu Gly Asp 
100 105 110 

Lys Gin Thr Ser Tyr Leu Thr Gly Ala Phe Thr Gly Ser Leu lie Gly 
115 120 125 

Ser Asp Gly Thr Lys Ser Tyr Ala lie Tyr Asp Leu Lys Lys Pro Leu 
130 135 140 

Phe Asp Thr Leu Asn Gly Ala Thr Val Arg Asp Leu Asp lie Lys Thr 
145 150 155 160 

Val Ser Ala Asp Ser Lys Glu Asn Val Ala Ala Leu Ala Lys Ala Ala 
165 170 175 

Asn Ser Ala Asn lie Asn Asn Val Ala Val Glu Gly Lys lie Ser Gly 
180 185 190 

Ala Lys Ser Val Ala Gly Leu Val Ala Ser Ala Thr Asn Thr Val lie 
195 200 205 

Glu Asn Ser Ser Phe Thr Gly Lys Leu lie Ala Asn His Gin Asp Ser 
210 215 220 

Asn Lys Asn Asp Thr Gly Gly lie Val Gly Asn lie Thr Gly Asn Ser 
225 230 235 240 

Ser Arg Val Asn Lys Val Arg Val Asp Ala Leu lie Ser Thr Asn Ala 
245 250 255 

Arg Asn Asn Asn Gin Thr Ala Gly Gly lie Val Gly Arg Leu Glu Asn 
260 265 270 

Gly Ala Leu lie Ser Asn Ser Val Ala Thr Gly Glu lie Arg Asn Gly 
275 280 285 

Gin Gly Tyr Ser Arg Val Gly Gly lie Val Gly Ser Thr Trp Gin Asn 
290 295 300 

Gly Arg Val Asn Asn Val Val Ser Asn Val Asp Val Gly Asp Gly Tyr 
305 310 315 320 
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Val lie Thr Gly Asp Gin Tyr Ala Ala Ala Asp Val Lys Asn Ala Ser 
325 330 335 

Thr Ser Val Asp Asn Arg Lys Ala Asp Arg Phe Ala Thr Lys Leu Ser 
340 345 350 

Lys Asp Gin lie Asp Ala Lys Val Ala Asp Tyr Gly lie Thr Val Thr 
355 360 365 

Leu Asp Asp Thr Gly Gin Asp Leu Lys Arg Asn Leu Arg Glu Val Asp 
370 375 380 

Tyr Thr Arg Leu Asn Lys Ala Glu Ala Glu Arg Lys Val Ala Tyr Ser 
385 390 395 400 

Asn lie Glu Lys Leu Met Pro Phe Tyr Asn Lys Asp Leu Val Val His 
405 410 415 

Tyr Gly Asn Lys Val Ala Thr Thr Asp Lys Leu Tyr Thr Thr Glu Leu 
420 425 430 

Leu Asp Val Val Pro Met Lys Asp Asp Glu Val Val Thr Asp lie Asn 
435 440 445 

Asn Lys Lys Asn Ser lie Asn Lys Val Met Leu His Phe Lys Asp Asn 
450 455 460 

Thr Val Glu Tyr Leu Asp Val Thr Phe Lys Glu Asn Phe lie Asn Ser 
465 470 475 480 

Gin Val lie Glu Tyr Asn Val Thr Gly Lys Glu Tyr lie Phe Thr Pro 
485 490 495 

Glu Ala Phe Val Ser Asp Tyr Thr Ala lie Thr Asn Asn Val Leu Ser 
500 505 510 

Asp Leu Gin Asn Val Thr Leu Asn 
515 520 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 850 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CTTTGGTTTT GAAGGAAGTA AGCGTGGACA ATTTGCTGTA GAAGGAATCA ATCAACTTCG 60 

TGAGCATGTA GACACTCTAT TGATTATCTC AAACAACAAT TTGCTTGAAA TTGTTGATAA 120 

GAAAACACCG CTTTTGGAGG CTCTTAGCGA AGCGGATAAC GTTCTTCGTC AAGGTGTTCA 180 

AGGGATTACC GATTTGATTA CCAATCCAGG ATTGATTAAC CTTGACTTTG CCGATGTGAA 240 

AACGGTAATG GCAAACAAAG GGAATGCTCT TATGGGTATT GGTATCGGTA GTGGAGAAGA 300 

ACGTGTGGTA GAAGCGGCAC GTAAGGCAAT CTATTCACCA CTTCTTGAAA CAACTATTGA 360 

CGGTGCTGAG GATGTTATCG TCAACGTTAC TGGTGGTCTT GACTTAACCT TGATTGAGGC 420 
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AGAAGAGGCT TCACAAATTG TGAACCAGGC AGCAGGTCAA GGAGTGAACA TCTGGCTCGG 480 

TACTTCAATT GATGAAAGTA TGCGTGATGA AATTCGTGTA ACAGTTGTTG CAACGGGTGT 540 

TCGTCAAGAC CGCGTAGAAA AGGTTGTGGC TCCACAAGCT AGATCTGCTA CTAACTACCG 600 

TGAGACAGTG AAACCAGCTC ATTCACATGG CTTTGATCGT CATTTTGATA TGGCAGAAAC 660 

AGTTGAATTG CCAAAACAAA ATCCACGTCG TTTGGAACCA ACTCAGGCAT CTGCTTTTGG 720 

TGATTGGGAT CTTCGCCGTG AATCGATTGT TCGTACAACA GATTCAGTCG TTTCTCCAGT 780 

CGAGCGCTTT GAAGCCCCAA TTTCACAAGA TGAAGATGAA TTGGATACAC CTCCATTTTT 840 

CAAAAATCGT 850 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Phe Gly Phe Glu Gly Ser Lys Arg Gly Gin Phe Ala Val Glu Gly lie 
15 10 15 

Asn Gin Leu Arg Glu His Val Asp Thr Leu Leu lie lie Ser Asn Asn 
20 25 30 

Asn Leu Leu Glu lie Val Asp Lys Lys Thr Pro Leu Leu Glu Ala Leu 
35 40 45 

Ser Glu Ala Asp Asn Val Leu Arg Gin Gly Val Gin Gly lie Thr Asp 
50 55 60 

Leu lie Thr Asn Pro Gly Leu lie Asn Leu Asp Phe Ala Asp Val Lys 
65 70 75 80 

Thr Val Met Ala Asn Lys Gly Asn Ala Leu Met Gly lie Gly lie Gly 
85 90 95 

Ser Gly Glu Glu Arg Val Val Glu Ala Ala Arg Lys Ala lie Tyr Ser 
100 105 110 

Pro Leu Leu Glu Thr Thr lie Asp Gly Ala Glu Asp Val lie Val Asn 
115 120 125 

Val Thr Gly Gly Leu Asp Leu Thr Leu lie Glu Ala Glu Glu Ala Ser 
130 135 140 

Gin lie Val Asn Gin Ala Ala Gly Gin Gly Val Asn lie Trp Leu Gly 
145 150 155 160 

Thr Ser lie Asp Glu Ser Met Arg Asp Glu lie Arg Val Thr Val Val 
165 170 175 

Ala Thr Gly Val Arg Gin Asp Arg Val Glu Lys Val Val Ala Pro Gin 
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' 180 185 190 

Ala Arg Ser Ala Thr Asn Tyr Arg Glu Thr Val Lys Pro Ala His Ser 
195 200 205 

His Gly Phe Asp Arg His Phe Asp Met Ala Glu Thr Val Glu Leu Pro 
210 215 220 

Lys Gin Asn Pro Arg Arg Leu Glu Pro Thr Gin Ala Ser Ala Phe Gly 
225 230 235 240 

Asp Trp Asp Leu Arg Arg Glu Ser lie Val Arg Thr Thr Asp Ser Val 
245 250 255 

Val Ser Pro Val Glu Arg Phe Glu Ala Pro lie Ser Gin Asp Glu Asp 
260 265 270 

Glu Leu Asp Thr Pro Pro Phe Phe Lys Asn Arg 
275 280 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

CTACTACCTC TCGAGAGAAA GTGACCTAGA GGTGACCGTT TTTGACCATG AGCAAGGTCA 60 

AGCCACCAAG GCCGCAGCAG GAATTATCAG TCCTTGGTTT TCCAAACGCC GTAATAAAGC 120 

CTGGTACAAG ATGGCGCGCT TGGGGGCTGA TTTTTATGTG GATTTATTAG CTGATTTAGA 180 

GAAATCAGGA CAAGAAATCG ACTTTTACCA GCGTTCGGGA GTCTTTCTCT TGAAAAAGGA 240 

TGAATCCAAT TTGGAAGAAC TTTATCAACT GGCCCTCCAG CGCAGAGAAG AATCTCCCTT 300 

GATAGGGCAA TTAGCCATTC TGAACCAAGC CTCAGCTAAT GAATTATTCC CTGGTTTGCA 360 

GGGATTTGAC CGCCTGCTCT ATGCTTCTGG TGGAGCGAGA GTAGATGGCC AACTTTTAGT 420 

GACTCGTTTG CTGGAAGTCA GTCATGTCAA GCTGGTCAAA GAAAAAGTGA CTCTGACACC 480 

GTTAGCATCA GGCTACCAGA TTGGTGAAGA GGAGTTTGAG CAGGTTATTT TGGCGACGGG 540 

AGCTTGGTTG GGGGACATGT TAGAGCCTTT AGGTTATGAA GTGGATGTCC GTCCTCAAAA 600 

AGGACAACTA CGAGATTATC AGCTTGCCCA AGACATGGAA GATTACCCTG TTGTCATGCC 660 

AGAAGGGGAG TGGGATTTGA TTCCCTTTGC AGGTGGGAAA TTATCCTTAG GCGCTACCCA 720 

CGAAAATGAC ATGGGATTTG ATTTGACGGT AGATGAAACC TTGCTCCAAC AAATGGAGGA 780 

GGCCACCTTG ACTCACTATC TGATTTTGGC TGAAGCTACT TCAAAATCTG AGCGTGTTGG 840 

AATCCGTGCC TACACCAGTG ATTTCTCTCC TTTCTTTGGG CAGGTGCCTG ACTTAACTGG 900 

TGTCTATGCA GCCAGTGGAC TAGGTTCATC AGGCCTCACA ACTGGTCCTA TCATTGGTTA 960 
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CCATCTAGCC CAACTGATCC AAGACAAGGA GTTGACCTTG GACCCTCTAA ATTACCCAAT 1020 ' 'J 

TGAAAACTAT GTCAAACGAG TAAAAAGCGA A 1051 \^ 

(2) INFORMATION FOR SEQ ID NO: 126: I 

(i) SEQUENCE CHARACTERISTICS: J 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Tyr Tyr Leu Ser Arg Glu Ser Asp Leu Glu Val Thr Val Phe Asp His 
15 10 15 

Glu Gin Gly Gin Ala Thr Lys Ala Ala Ala Gly lie lie Ser Pro Trp 
20 25 30 

Phe Ser Lys Arg Arg Asn Lys Ala Trp Tyr Lys Met Ala Arg Leu Gly 
35 40 45 

Ala Asp Phe Tyr Val Asp Leu Leu Ala Asp Leu Glu Lys Ser Gly Gin 
50 55 60 

Glu lie Asp Phe Tyr Gin Arg Ser Gly Val Phe Leu Leu Lys Lys Asp 
65 70 75 80 

Glu Ser Asn Leu Glu Glu Leu Tyr Gin Leu Ala Leu Gin Arg Arg Glu 
85 90 95 

Glu Ser Pro Leu lie Gly Gin Leu Ala lie Leu Asn Gin Ala Ser Ala 
100 105 110 

Asn Glu Leu Phe Pro Gly Leu Gin Gly Phe Asp Arg Leu Leu Tyr Ala 
115 120 125 

Ser Gly Gly Ala Arg Val Asp Gly Gin Leu Leu Val Thr Arg Leu Leu 
130 135 140 

Glu Val Ser His Val Lys Leu Val Lys Glu Lys Val Thr Leu Thr Pro 
145 150 155 160 

Leu Ala Ser Gly Tyr Gin He Gly Glu Glu Glu Phe Glu Gin Val He 
165 170 175 - 

Leu Ala Thr Gly Ala Trp Leu Gly Asp Met Leu Glu Pro Leu Gly Tyr 
180 185 190 

Glu Val Asp Val Arg Pro Gin Lys Gly Gin Leu Arg Asp Tyr Gin Leu 
195 200 205 

Ala Gin Asp Met Glu Asp Tyr Pro Val Val Met Pro Glu Gly Glu Trp 
210 215 220 

Asp Leu He Pro Phe Ala Gly Gly Lys Leu Ser Leu Gly Ala Thr His 
225 230 235 240 

Glu Asn Asp Met Gly Phe Asp Leu Thr Val Asp Glu Thr Leu Leu Gin 
245 250 255 
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Gin Met Glu Glu Ala Thr Leu Thr His Tyr Leu lie Leu Ala Glu Ala 
260 265 270 

Thr Ser Lys Ser Glu Arg Val Gly lie Arg Ala Tyr Thr Ser Asp Phe 
275 280 285 

Ser Pro Phe Phe Gly Gin Val Pro Asp Leu Thr Gly Val Tyr Ala Ala 
290 295 300 

Ser Gly Leu Gly Ser Ser Gly Leu Thr Thr Gly Pro lie lie Gly Tyr 
305 310 315 320 

His Leu Ala Gin Leu lie Gin Asp Lys Glu Leu Thr Leu Asp Pro Leu 
325 330 335 

Asn Tyr Pro lie Glu Asn Tyr Val Lys Arg Val Lys Ser Glu 
340 345 350 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

TAAGGTCAAA AGTCAGACCG CTAAGAAAGT GCTAGAAAAG ATTGGAGCTG ACTCGGTTAT 60 

CTCGCCAGAG TATGAAATGG GGCAGTCTCT AGCACAGACC ATTCTTTTCC ATAATAGTGT 120 

TGATGTCTTT CAGTTGGATA AAAATGTGTC TATCGTGGAG ATGAAAATTC CTCAGTCTTG 180 

GGCAGGTCAA AGTCTGAGTA AATTAGACCT CCGTGGCAAA TACAATCTGA ATATTTTGGG 240 

TTTCCGAGAG CAGGAAAATT CCCCATTGGA TGTTGAATTT GGACCAGATG ACCTCTTGAA 300 

AGCAGATACC TATATTTTGG CAGTCATCAA CAACCAGTAT TTGGATACCC TA 352 
(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Lys Val Lys Ser Gin Thr Ala Lys Lys Val Leu Glu Lys lie Gly Ala 
15 10 15 

Asp Ser Val lie Ser Pro Glu Tyr Glu Met Gly Gin Ser Leu Ala Gin 
20 25 30 

Thr lie Leu Phe His Asn Ser Val Asp Val Phe Gin Leu Asp Lys Asn 
35 40 45 
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Val Ser He Val Glu Met Lys He Pro Gin Ser Trp Ala Gly Gin Ser 
50 55 60 

Leu Ser Lys Leu Asp Leu Arg Gly Lys Tyr Asn Leu Asn He Leu Gly 
65 70 75 80 

Phe Arg Glu Gin Glu Asn Ser Pro Leu Asp Val Glu Phe Gly Pro Asp 
85 90 95 

Asp Leu Leu Lys Ala Asp Thr Tyr He Leu Ala Val He Asn Asn Gin 
100 105 110 

Tyr Leu Asp Thr Leu 
115 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

TGACGGGTCT CAGGATCAGA CTCAGGAAAT CGCTGAGTGT TTAGCTAGCA AGTATCCTAA 60 

TATCGTTAGA GCCATCTATC AGGAAAATAA ATGCCATGGC GGTGCGGTCA ATCGTGGCTT 120 

GGTAGAGGCT TCTGGGCGCT ATTTTAAAGT AGTTGACAGT GATGACTGGG TGGATCCTCG 180 

TGCCTACTTG AAAATTCTTG AAACTTGCAG GAACTTGAGA GCAAAGGTCA AGAGGTGGAT 240 

GTCTTTG 247 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Asp Gly Ser Gin Asp Gin Thr Gin Glu He Ala Glu Cys Leu Ala Ser 
15 10 15 

Lys Tyr Pro Asn He Val Arg Ala He Tyr Gin Glu Asn Lys Cys His 
20 25 30 

Gly Gly Ala Val Asn Arg Gly Leu Val Glu Ala Ser Gly Arg Tyr Phe 
35 40 45 

Lys Val Val Asp Ser Asp Asp Trp Val Asp Pro Arg Ala Tyr Leu Lys 
50 55 60 

He Leu Glu Thr Cys Arg Asn Leu Arg Ala Lys Val Lys Arg Trp Met 
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65 70 75 80 

Ser Leu 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



TAGAGGCTTT 


GCCAAATGGT 


GGGAAGGGCA 


CGAGCGTCGA 


AAAGAGGAAC 


GCTTTGTCAA 


60 


ACAAGAAGAA 


AAAGCTCGCC 


AAAAGGCTGA 


GAAAGAGGCT 


AGATTAGAAC 


AAGAAGAGAC 


120 


TGAAAAAGCC 


TTACTCGATT 


TGCCTCCTGT 


TGATATGGAA 


ACGGGTGAAA 


TTCTGACAGA 


180 


GGAAGCTGTT 


CAAAATCTTC 


CACCTATTCC 


AGAAGAAAAG 


TGGGTGGAAC 


CAGAAATCAT 


240 


CCTGCCTCAA 


GCTGAACTTA 


AATTCCCTGA 


ACAGGAAGAT 


GACTCAGATG 


ACGAAGATGT 


300 


TCAGGTCGAT 


TTTTCAGCCA 


AAGAAGCCCT 


TGAATACAAA 


CTTCCAAGCT 


TACAACTCTT 


360 


TGCACCAGAT 


AAACCAAAAG 


ATCAGTCTAA 


AGAGAAGAAA 


ATTGTCAGAG 


AAAATATCAA 


420 


AATCTTAGAA 


GCAACCTTTG 


CTAGCTTTGG 


TATTAAGGTA 


ACAGTTGAAC 


GGGCCGAAAT 


480 


TGGGCCATCA 


GTGACCAAGT 


ATGAAGTCAA 


GCCGGCTGTT 


GGTGTAAGGG 


TCAACCGCAT 


540 


TTCCAATCTA 


TCAGATGACC 


TCGCTCTAGC 


CTTGGCTGCC 


AAAGATGTCC 


GGATTGAAGC 


600 


ACCAATCCCT 


GGGAAATCCC 


TAATCGGAAT 


TGAAGTGCCC 


AACTCCGATA 


TTGCCACTGT 


660 


ATCTTTCCGA 


GAACTATGGG 


AACAATCGCA 


AACGAAAGCA 


GAAAATTTCT 


TGGAAATTCC 


720 


TTTAGGGAAG 


GCTGTTAATG 


GAACCGCAAG 


AGCTTTTGAC 


CTTTCTAAAA 


TGCCCCACTT 


780 


GCTAGTTGCA 


GGTTCAACGG 


GTTCAGGGAA 


GTCAGTAGCA 


GTTAACGGCA 


TTATTGCTAG 


840 


CATTCTCATG 


AAGGCGAGAC 


CAGATCAAGT 


TAAATTTATG 


ATGGTCGATC 


CCAAGATGGT 


900 


TGAGTTATCT 


GTTTACAATG 


ATATTCCCCA 


CCTCTTGATT 


CCAGTCGTGA 


CCAATCCACG 


960 


CAAAGCCAGC 


AAGGCTCTGC 


AAAAGGTTGT 


GGATGAAATG 


GAAAACCGTT 


ATGAACTCTT 


1020 


TGCCAAGGTG 


GGAGTTCGGA 


ATATTGCAGG 


TTTTAATGCC 


AAGGTAGAAG 


AGTTCAATTC 


1080 


CCAGTCTGAG 


TACAAGCAAA 


TTCCGCTACC 


ATTCATTGTC 


GTGATTGTGG 


ATGAGTTGGC 


1140 


TGACCTCATG 


ATGGTGGCCA 


GCAAGGAAGT 


GGAAGATGCT 


ATCATCCGTC 


TTGGGCAGAA 


1200 


GGCGCGTGCT 


GCAGGTATCC 


ACATGATTCT 


TGCAACTCAG 


CGTCCATCTG 


TTGATGTCAT 


1260 


CTCTGGTTTG 


ATTAAGGCCA 


ATGTTCCATC 


TCGTGTAGCA 


TTTGCGGTTT 


CATCAGGAAC 


1320 


AGACTCCCGT 


ACGATTTTGG 


ATGAAAATGG 


AGCAGAAAAA 


CTTCTTGGTC 


GAGGAGACAT 


1380 


GCTCTTTAAA 


CCGATTGATG 


AAAATCATCC 


AGTTCGTCTC 


CAAGGCTCCT 


TTATCTCGGA 


1440 
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TGACGATGTT GAGCGCATTG TGAACTTCAT CAAGACTCAG GCAGATGCAG ACTACGATGA 1500 

GAGTTTTGAT CCAGGTGAGG TTTCTGAAAA TGAAGGAGAA TTTTCGGATG GAGATGCTGG 1560 

TGGTGATCCG CTTTTTGAAG AAGCTAAGTC TTTGGTTATC GAAACACAGA AAGCCAGTGC 1620 

GTCTATGATT CAGCGTCGTT TATCAGTTGG ATTTAACCGT GCGACCCGTC TCATGGAAGA 1680 

ACTGGAGATA GCAGGTGTCA TCGGTCCAGC TGAAGGTACC AAACCTCGAA AAGTGTTACA 1740 

ACAA 1744 
(2) INFORMATION FOR SEQ ID NO:132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Arg Gly Phe Ala Lys Trp Trp Glu Gly His Glu Arg Arg Lys Glu Glu 
15 10 15 

Arg Phe Val Lys Gin Glu Glu Lys Ala Arg Gin Lys Ala Glu Lys Glu 
20 25 30 

Ala Arg Leu Glu Gin Glu Glu Thr Glu Lys Ala Leu Leu Asp Leu Pro 
35 40 45 

Pro Val Asp Met Glu Thr Gly Glu lie Leu Thr Glu Glu Ala Val Gin 
50 55 60 

Asn Leu Pro Pro lie Pro Glu Glu Lys Trp Val Glu Pro Glu lie lie 
65 70 75 80 

Leu Pro Gin Ala Glu Leu Lys Phe Pro Glu Gin Glu Asp Asp Ser Asp 
85 90 95 

Asp Glu Asp Val Gin Val Asp Phe Ser Ala Lys Glu Ala Leu Glu Tyr 
100 105 110 

Lys Leu Pro Ser Leu Gin Leu Phe Ala Pro Asp Lys Pro Lys Asp Gin 
115 120 125 

Ser Lys Glu Lys Lys lie Val Arg Glu Asn lie Lys lie Leu Glu Ala 
130 135 140 

Thr Phe Ala Ser Phe Gly He Lys Val Thr Val Glu Arg Ala Glu He 
145 150 155 160 

Gly Pro Ser Val Thr Lys Tyr Glu Val Lys Pro Ala Val Gly Val Arg 
165 170 175 

Val Asn Arg He Ser Asn Leu Ser Asp Asp Leu Ala Leu Ala Leu Ala 
180 185 190 

Ala Lys Asp Val Arg He Glu Ala Pro He Pro Gly Lys Ser Leu He 
195 200 205 
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Gly lie Glu Val Pro Asn Ser Asp lie Ala Thr Val Ser Phe Arg Glu 
210 215 220 

Leu Trp Glu Gin Ser Gin Thr Lys Ala Glu Asn Phe Leu Glu lie Pro 
225 230 235 240 

Leu Gly Lys Ala Val Asn Gly Thr Ala Arg Ala Phe Asp Leu Ser Lys 
245 250 255 

Met Pro His Leu Leu Val Ala Gly Ser Thr Gly Ser Gly Lys Ser Val 
260 265 270 

Ala Val Asn Gly lie lie Ala Ser lie Leu Met Lys Ala Arg Pro Asp 
275 280 285 

Gin Val Lys Phe Met Met Val Asp Pro Lys Met Val Glu Leu Ser Val 
290 295 300 

Tyr Asn Asp lie Pro His Leu Leu lie Pro Val Val Thr Asn Pro Arg 
305 310 315 320 

Lys Ala Ser Lys Ala Leu Gin Lys Val Val Asp Glu Met Glu Asn Arg 
325 330 335 

Tyr Glu Leu Phe Ala Lys Val Gly Val Arg Asn lie Ala Gly Phe Asn 
340 345 350 

Ala Lys Val Glu Glu Phe Asn Ser Gin Ser Glu Tyr Lys Gin lie Pro 
355 360 365 

Leu Pro Phe lie Val Val lie Val Asp Glu Leu Ala Asp Leu Met Met 
370 375 380 

Val Ala Ser Lys Glu Val Glu Asp Ala lie lie Arg Leu Gly Gin Lys 
385 390 395 400 

Ala Arg Ala Ala Gly lie His Met lie Leu Ala Thr Gin Arg Pro Ser 
405 410 415 

Val Asp Val lie Ser Gly Leu lie Lys Ala Asn Val Pro Ser Arg Val 
420 425 430 

Ala Phe Ala Val Ser Ser Gly Thr Asp Ser Arg Thr lie Leu Asp Glu 
435 440 445 

Asn Gly Ala Glu Lys Leu Leu Gly Arg Gly Asp Met Leu Phe Lys Pro 
450 455 460 

lie Asp Glu Asn His Pro Val Arg Leu Gin Gly Ser Phe lie Ser Asp 
465 470 475 480 

Asp Asp Val Glu Arg lie Val Asn Phe lie Lys Thr Gin Ala Asp Ala 
485 490 495 

Asp Tyr Asp Glu Ser Phe Asp Pro Gly Glu Val Ser Glu Asn Glu Gly 
500 505 510 

Glu Phe Ser Asp Gly Asp Ala Gly Gly Asp Pro Leu Phe Glu Glu Ala 
515 520 525 

Lys Ser Leu Val lie Glu Thr Gin Lys Ala Ser Ala Ser Met lie Gin 
530 535 540 



136 



Arg Arg Leu Ser Val Gly Phe Asn Arg Ala Thr Arg Leu Met Glu Glu 
545 550 555 560 

Leu Glu lie Ala Gly Val lie Gly Pro Ala Glu Gly Thr Lys Pro Arg 
565 570 575 

Lys Val Leu Gin Gin 
580 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



TCAAAAAGAG 


AAGGAAAACT 


TGGTTATTGC 


TGGGAAAATA 


GGTCCAGAAC 


CAGAAATTTT 


60 


GGCCAATATG 


TATAAGTTGC 


TGATTGAAGA 


AAATACCAGC 


ATGACTGCGA 


CTGTTAAACC 


120 


GAATTTTGGG 


AAGACAAGCT 


TCCTTTATGA 


AGCTCTGAAA 


AAAGGCGATA 


TTGACATCTA 


180 


TCCTGAATTT 


ACTGGTACGG 


TGACTGAAAG 


TTTGCTTCAA 


CCATCACCCA 


AGGTGAGTCA 


240 


TGAACCAGAA 


CAGGTTTATC 


AGGTGGCGCG 


TGATGGCATT 


GCTAAGCAGG 


ATCATCTAGC 


300 


CTATCTCAAA 


CCCATGTCTT 


ATCAAAACAC 


CTATGCTGTA 


GCTGTTCCGA 


AAAAGATTGC 


360 


TCAAGAATAT 


GGCTTGAAGA 


CCATTTCAGA 


CTTGAAAAAA 


GTGGAAGGGC 


AGTTGAAGGC 


420 


AGGTTTTACA 


CTCGAGTTTA 


ACGACCGTGA 


AGATGGAAAT 


AAGGGCTTGC 


AATCAATGTA 


480 


TGGTCTCAAT 


CTCAATGTAG 


CGACCATTGA 


GCCAGCCCTT 


CGCTATCAGG 


CTATTCAGTC 


540 


AGGGGATATT 


CAAATCACGG 


ATGCCTATTC 


GACTGATGCG 


GAATTGGAGC 


GTTATGATTT 


600 


ACAGGTCTTG 


GAAGATGACA 


AGCAACTCTT 


CCCACCTTAT 


CAAGGGGCTC 


CACTCATGAA 


660 


AGAAGCTCTT 


CTCAAGAAAC 


ACCCAGAGTT 


GGAAAGAGTT 


CTTAATACAT 


TGGCTGGTAA 


720 


GATTACAGAA 


AGCCAGATGA 


GCCAGCTCAA 


CTACCAAGTC 


GGTGTTGAAG 


GCAAGTCAGC 


780 


AAAGCAAGTA 


GCCAAGGAGT 


TTCTCCAAGA 


ACAAGGTTTG 


TTGAAGAAA 




829 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



Gin Lys Glu Lys Glu. Asn Leu Val lie Ala Gly Lys lie Gly Pro Glu 
15 10 15 
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Pro Glu lie Leu Ala Asn Met Tyr Lys Leu Leu lie Glu Glu Asn Thr 
20 25 30 

Ser Met Thr Ala Thr Val Lys Pro Asn Phe Gly Lys Thr Ser Phe Leu 
35 40 45 

Tyr Glu Ala Leu Lys Lys • Gly Asp lie Asp lie Tyr Pro Glu Phe Thr 
50 55 60 

Gly Thr Val Thr Glu Ser Leu Leu Gin Pro Ser Pro Lys Val Ser His 
65 70 75 80 

Glu Pro Glu Gin Val Tyr Gin Val Ala Arg Asp Gly He Ala Lys Gin 
85 90 95 

Asp His Leu Ala Tyr Leu Lys Pro Met Ser Tyr Gin Asn Thr Tyr Ala 
100 105 110 

Val Ala Val Pro Lys Lys He Ala Gin Glu Tyr Gly Leu Lys Thr He 
115 120 125 

Ser Asp Leu Lys Lys Val Glu Gly Gin Leu Lys Ala Gly Phe Thr Leu 
130 135 140 

Glu Phe Asn Asp Arg Glu Asp Gly Asn Lys Gly Leu Gin Ser Met Tyr 
145 150 155 160 

Gly Leu Asri Leu Asn Val Ala Thr He Glu Pro Ala Leu Arg Tyr Gin 
165 170 175 

Ala He Gin Ser Gly Asp He Gin He Thr Asp Ala Tyr Ser Thr Asp 
180 185 190 

Ala Glu Leu Glu Arg Tyr Asp Leu Gin Val Leu Glu Asp Asp Lys Gin 
195 200 205 

Leu Phe Pro Pro Tyr Gin Gly Ala Pro Leu Met Lys Glu Ala Leu Leu 
210 215 220 

Lys Lys His Pro Glu Leu Glu Arg Val Leu Asn Thr Leu Ala Gly Lys 
225 230 235 240 

He Thr Glu Ser Gin Met Ser Gin Leu Asn Tyr Gin Val Gly Val Glu 
245 250 255 

Gly Lys Ser Ala Lys Gin Val Ala Lys Glu Phe Leu Gin Glu Gin Gly 
260 265 270 

Leu Leu Lys Lys 
275 

INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 712 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 
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ACGTTCTATT GAGGACCACT TTGATTCAAA CTTCGAATTG GAATATAACC TCAAAGAAAA 60 

AGGGAAAACA GATCTTTTGA AGCTAGTTGA TAAAACAACT GACATGCGTC TGCATTTTAT 120 

CCGCCAAACT CATCCACGCG GTCTCGGAGA TGCTGTTTTG CAAGCCAAGG CTTTCGTCGG 180 

AAATGAACCT TTTGTCGTTA TGCTTGGTGA TGACTTGATG GATATCACAG ACGAAAAGGC 240 

TGTTCCACTT ACCAAACAAC TCATGGATGA CTACGAGCGT ACCCACGCGT CTACTATCGC 300 

TGTCATGCCA GTCCCTCATG ACGAAGTATC TGCTTACGGG GTTATTGCTC CGCAAGGCGA 360 

AGGAAAAGAT GGTCTTTACA GTGTTGAAAC CTTTGTTGAA AAACCAGCTC CAGAGGACGC 420 

TCCTAGCGAC CTTGCTATTA TCGGACGCTA CCTCCTCACG CCTGAAATTT TTGAGATTCT 480 

CGAAAAGCAA GCTCCAGGTG CAGGAAATGA AATTCAGCTG ACAGATGCAA TCGACACCCT 540 

CAATAAAACA CAACGTGTAT TTGCTCGTGA GTTCAAAGGG GCTCGTTACG ATGTCGGAGA 600 

CAAGTTTGGC TTCATGAAAA CATCCATCGA CTACGCCCTC AAACACCCAC AAGTCAAAGA 660 

TGATTTGAAG AATTACCTCA TCCAACTTGG AAAAGAATTG ACTGAGAAGG AA 712 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Arg Ser lie Glu Asp His Phe Asp Ser Asn Phe Glu Leu Glu Tyr Asn 
15 10 15 

Leu Lys Glu Lys Gly Lys Thr Asp Leu Leu Lys Leu Val Asp Lys Thr 
20 25 30 

Thr Asp Met Arg Leu His Phe lie Arg Gin Thr His Pro Arg Gly Leu 
35 40 45 

Gly Asp Ala Val Leu Gin Ala Lys Ala Phe Val Gly Asn Glu Pro Phe 
50 55 60 

Val Val Met Leu Gly Asp Asp Leu Met Asp lie Thr Asp Glu Lys Ala 
65 70 75 80 

Val Pro Leu Thr Lys Gin Leu Met Asp Asp Tyr Glu Arg Thr His Ala 
85 90 95 

Ser Thr lie Ala Val Met Pro Val Pro His Asp Glu Val Ser Ala Tyr 
100 105 110 

Gly Val He Ala Pro Gin Gly Glu Gly Lys Asp Gly Leu Tyr Ser Val 
115 120 125 

Glu Thr Phe Val Glu Lys Pro Ala Pro Glu Asp Ala Pro Ser Asp Leu 
130 135 140 



139 



Ala lie lie Gly Arg Tyr Leu Leu 
145 150 

Glu Lys Gin Ala Pro Gly Ala Gly 
165 

lie Asp Thr Leu Asn Lys Thr Gin 
180 

Gly Ala Arg Tyr Asp Val Gly Asp 
195 200 

lie Asp Tyr Ala Leu Lys His Pro 
210 215 

Tyr Leu lie Gin Leu Gly Lys Glu 
225 230 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

CGCTCAAAAT ACCAGAGGTG TTCAGCTAAT CGAGCACGTT TCTCCTCAAA TGTTGAAAGC 60 

CCAATTGGAG AGTGTCTTTT CTGATATTCC ACCTCAGGCT GTAAAAACTG GAATGTTGGC 120 

TACTACTGAA ATCATGGAAA TCATCCAACC CTATCTTAAA AAACTGGATT GTCCCTATGT 180 

CCTTGATCCT GTTATGGTTG CTACAAGTGG AGATGCCTTG ATTGACTCAA ATGCTAGAGA 240 

CTATCTCAAA ACAAACTTAC TACCTCTAGC AACTATTATT ACGCCAAATC TTCCTGAAGC 300 

AGAAGAGATT GTTGGTTTTT CAATCCATGA CCCCGAAGAC ATGCAGCGTG CTGGTCGCCT 360 

GATTTTAAAA GAATTTGGTC CTCAGTCTGT GGTTATCAAA GGCGGACATC TCAAAGGTGG 420 

TGCTAAAGAT TTCCTCTTTA CCAAGAATGA ACAATTTGTC TGGGAAAGCC CACGAATTCA 480 

AACCTGTCAC ACCCATGGTA CT 502 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

Ala Gin Asn Thr Arg Gly Val Gin Leu lie Glu His Val Ser Pro Gin 
15 10 15 



Thr Pro Glu He Phe. Glu He Leu 
155 160 

Asn Glu He Gin Leu Thr Asp Ala 
170 175 

Arg Val Phe Ala Arg Glu Phe Lys 
185 190 

Lys Phe Gly Phe Met Lys Thr Ser 
205 

Gin Val Lys Asp Asp Leu Lys Asn 
220 



Leu Thr Glu Lys Glu 
235 
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Met Leu Lys Ala Gin Leu Glu Ser Val Phe Ser Asp lie Pro Pro Gin 
20 25 30 

Ala Val Lys Thr Gly Met Leu Ala Thr Thr Glu lie Met Glu lie lie 
35 40 45 

Gin Pro Tyr Leu Lys Lys Leu Asp Cys Pro Tyr Val Leu Asp Pro Val 
50 55 60 

Met Val Ala Thr Ser Gly Asp Ala Leu lie Asp Ser Asn Ala Arg Asp 
65 70 75 80 

Tyr Leu Lys Thr Asn Leu Leu Pro Leu Ala Thr lie lie Thr Pro Asn 
85 90 95 

Leu Pro Glu Ala Glu Glu lie Val Gly Phe Ser lie His Asp Pro Glu 
100 105 110 

Asp Met Gin Arg Ala Gly Arg Leu lie Leu Lys Glu Phe Gly Pro Gin 
115 120 125 

Ser Val Val lie Lys Gly Gly His Leu Lys Gly Gly Ala Lys Asp Phe 
130 135 140 

Leu Phe Thr Lys Asn Glu Gin Phe Val Trp Glu Ser Pro Arg lie Gin 
145 150 155 160 

Thr Cys His Thr His Gly Thr 
165 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

AATTGTACAA TTAGAAAAAG ATAGCAAATC AGACAAAGAA CAAGTTGATA AACTATTTGA 60 

ATCATTTGAT GCATCTTCAG ATGAATCTAT TTCTAAATTA AAAGAACTAT CTGAAACTTC 120 

ACTTAAAACC GATGCAGGTA AAGACTATCT TAATAACAAA GTCAAAGAAT CATCTAAAGC 180 

AATTGTAGAT TTTCATTTGC AAAAAGGTTT GGCTTATGAT GTTAAAGATT CAGATGACAA 240 

ATTTAAAGAT AAAGCAACTC TTGAAACAAA TGTAAAAGAA ATTACAAAAC AAATTGATTT 3 00 

TATCAAAAAA GTTGATGAAA CTTTTAAACA AGAGAATTTG GAAGAAACTC TTAAATCTCT 360 

AAATGATCTT GTTGATAAAT ATCAAAAACA AATCGAACTT TTGAAGAAAG AAGAAGAAAA 420 

AGCTGCTGAA AAAGCTGCTG AAAAAGCAAA GGAATCTTCT AGTCAAAGTA ATTCTTCTGG 480 

TAGTGCTTCT AATGAGTCTT ATAATGGATC TTCCAATTCA AATGTAGATT ATAGTTCATC 540 

TGAACAAACT AATGGATATT CAAATAATTA TGGCGGTCAA GATTATTCTG GTTCAGGAGA 600 

TAGTTCAACA AATGGTGGAT CATCAGAACA ATATTCATCT AGCAATTCAA ACAGCGGAGC 660 
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AAATAATGTC TACAGATATA AAGGCACTGG TGCTGACGGC TATCAAAGAT ACTACTACAA 720 

AGATCATAAT AATGGAGATG TGTATGATGA CGATGGAAAT TACCTTGGGA ACTTTGGTGG 780 

CGGCATTGCA GAACCTAGTC AACGC 805 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

lie Val Gin Leu Glu Lys Asp Ser Lys Ser Asp Lys Glu Gin Val Asp 
15 10 15 

Lys Leu Phe Glu Ser Phe Asp Ala Ser Ser Asp Glu Ser lie Ser Lys 
20 25 30 

Leu Lys Glu Leu Ser Glu Thr Ser Leu Lys Thr Asp Ala Gly Lys Asp 
35 40 45 

Tyr Leu Asn Asn Lys Val Lys Glu Ser Ser Lys Ala lie Val Asp Phe 
50 55 60 

His Leu Gin Lys Gly Leu Ala Tyr Asp Val Lys Asp Ser Asp Asp Lys 
65 70 75 80 

Phe Lys Asp Lys Ala Thr Leu Glu Thr Asn Val Lys Glu lie Thr Lys 
85 90 95 

Gin lie Asp Phe lie Lys Lys Val Asp Glu Thr Phe Lys Gin Glu Asn 
100 105 110 

Leu Glu Glu Thr Leu Lys Ser Leu Asn Asp Leu Val Asp Lys Tyr Gin 
115 120 125 

Lys Gin lie Glu Leu Leu Lys Lys Glu Glu Glu Lys Ala Ala Glu Lys 
130 135 140 

Ala Ala Glu Lys Ala Lys Glu Ser Ser Ser Gin Ser Asn Ser Ser Gly 
145 150 155 160 

Ser Ala Ser Asn Glu Ser Tyr Asn Gly Ser Ser Asn Ser Asn Val Asp 
165 170 175 

Tyr Ser Ser Ser Glu Gin Thr Asn Gly Tyr Ser Asn Asn Tyr Gly Gly 
180 185 190 

Gin Asp Tyr Ser Gly Ser Gly Asp Ser Ser Thr Asn Gly Gly Ser Ser 
195 200 205 

Glu Gin Tyr Ser Ser Ser Asn Ser Asn Ser Gly Ala Asn Asn Val Tyr 
210 215 220 

Arg Tyr Lys Gly Thr Gly Ala Asp Gly Tyr Gin Arg Tyr Tyr Tyr Lys 
225 230 235 240 
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Asp His Asn Asn Gly Asp Val Tyr Asp Asp Asp Gly Asn Tyr Leu Gly 

245 250 255 - ^ 

Asn Phe Gly Gly Gly lie Ala Glu Pro Ser Gin Arg g 
260 265 f 

(2) INFORMATION FOR SEQ ID NO: 141: ^ 

(i) SEQUENCE CHARACTERISTICS: J 

(A) LENGTH: 211 base pairs - 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

TCTGACCAAG CAAAAAGAAG CAGTCAATGA CAAAGGAAAA GCAGCTGTTG TTAAGGTGGT 60 

GGAAAGCCAG GCAGAACTTT ATAGCTTAGA AAAGAATGAA GATGCTAGCC TAAGAAAGTT 120 

ACAAGCAGAT GGACGCATCA CGGAAGAACA GGCTAAAGCT TATAAAGAAT ACAATGATAA 180 

AAATGGAGGA GCAAATCGTA AAGTCAATGA T 211 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Leu Thr Lys Gin Lys Glu Ala Val Asn Asp Lys Gly Lys Ala Ala Val 
1 5 10 15 

Val Lys Val Val Glu Ser Gin Ala Glu Leu Tyr Ser Leu Glu Lys Asn 
20 25 30 

Glu Asp Ala Ser Leu Arg Lys Leu Gin Ala Asp Gly Arg lie Thr Glu 
35 40 45 

Glu Gin Ala Lys Ala Tyr Lys Glu Tyr Asn Asp Lys Asn Gly Gly Ala 
50 55 60 

Asn Arg Lys Val Asn Asp 
65 70 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

GTCCGGCTCT GTCCAGTCCA CTTTTTCAGC GGTAGAGGAA CAGATTTTCT TTATGGAGTT 60 

TGAAGAACTC TATCGGGAAA CCCAAAAACG CAGTGTAGCC AGTCAGCAAA AGACTAGTCT 120 

GAACTTAGAT GGGCAGACGC TTAGCAATGG CAGTCAAAAG TTGCCAGTCC CTAAAGGAAT 180 

TCAGGCCCCA TCAGGCCAAA GTATTACATT TGACCGAGCT GGGGGCAATT CGTCCCTGGC 240 

TAAGGTTGAA TTTCAGACCA GTAAAGGAGC GATTCGCTAT' CAATTATATC TAGGAAATGG 300 

AAAAATTAAA CGCATTAAGG AAACAAAAAA T 331 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Ser Gly Ser Val Gin Ser Thr Phe Ser Ala Val Glu Glu Gin lie Phe 
15 10 15 

Phe Met Glu Phe Glu Glu Leu Tyr Arg Glu Thr Gin Lys Arg Ser Val 
20 25 30 

Ala Ser Gin Gin Lys Thr Ser Leu Asn Leu Asp Gly Gin Thr Leu Ser 
35 40 45 

Asn Gly Ser Gin Lys Leu Pro Val Pro Lys Gly lie Gin Ala Pro Ser 
50 55 60 

Gly Gin Ser lie Thr Phe Asp Arg Ala Gly Gly Asn Ser Ser Leu Ala 
65 70 75 80 

Lys Val Glu Phe Gin Thr Ser Lys Gly Ala lie Arg Tyr Gin Leu Tyr 
85 90 95 

Leu Gly Asn Gly Lys lie Lys Arg lie Lys Glu Thr Lys Asn 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GGGACAAATT CAAAAAAATA GGCAAGAGGA AGCAAAAATC TTGCAAAAGG AAGAAGTCTT 
GAGGGTAGCT AAGATGGCCC TGCAGACGGG GCAAAATCAG GTAAGCATCA ACGGAGTTGA 



60 
120 
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GATTCAGGTA TTTTCTAGTG AAAAAGGATT GGAGGTCTAC CATGGTTCAG AACAGTTGTT 180 
GGCAATCAAA GAGCCA 196 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Gly Gin lie Gin Lys Asn Arg Gin Glu Glu Ala Lys lie Leu Gin Lys 
1 5 10 . 15 

Glu Glu Val Leu Arg Val Ala Lys Met Ala Leu Gin Thr Gly Gin Asn 
20 25 30 

Gin Val Ser He Asn Gly Val Glu He Gin Val Phe Ser Ser Glu Lys 
35 40 45 

Gly Leu Glu Val Tyr His Gly Ser Glu Gin Leu Leu Ala He Lys Glu 
50 55 60 

Pro 
65 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

TCGCTACCAG CAACAAAGCG AGCAAAAGGA GTGGCTCTTG TTTGTGGACC AACTTGAGGT 60 

AGAATTAGAC CGTTCGCAGT TCGAAAAAGT AGAAGGCAAT CGCCTATACA TGAAGCAAGA 120 

TGGCAAGGAC ATCGCCATCG GTAAGTCAAA GTCAGATGAT TTCCGTAAAA CGAATGCTCG 180 

TGGTCGAGGT TATCAGCCTA TGGTTTATGG ACTCAAATCT GTACGGATTA CAGAGGACAA 240 

TCAACTGGTT CGCTTTCATT TCCAGTTCCA AAAAGGCTTA GAAAGGGAGT TCATCTATCG 300 

TGTGGAAAAA GAAAAAAGT 319 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Arg Tyr Gin Gin Gin Ser Glu Gin Lys Glu Trp Leu Leu Phe Val Asp 
15 10 15 

Gin Leu Glu Val Glu Leu Asp Arg Ser Gin Phe Glu Lys Val Glu Gly 
20 25 30 

Asn Arg Leu Tyr Met Lys Gin Asp Gly Lys Asp lie Ala lie Gly Lys 
35 40 45 

Ser Lys Ser Asp Asp Phe Arg Lys Thr Asn Ala Arg Gly Arg Gly Tyr 
50 55 60 

Gin Pro Met Val Tyr Gly Leu Lys Ser Val Arg lie Thr Glu Asp Asn 
65 70 75 80 

Gin Leu Val Arg Phe His Phe Gin Phe Gin Lys Gly Leu Glu Arg Glu 
85 90 95 

Phe lie Tyr Arg Val Glu Lys Glu Lys Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

GAACCGACAA GTCGCCCACT ATCAAGACTA TGCTTTGAAT AAAGAAAAAT TGGTTGCTTT 60 

TGCTATGGCT AAACGAACCA AAGATAAGGT TGAGCAAGAA AGTGGGGAAC AGTTTTTTAA 120 

TCTAGGTCAG GTAAGCTATC AAAACAAGAA AACTGGCTTA GTGACGAGGG TTCGTACGGA 180 

TAAGAGCCAA TATGAGTTTC TGTTTCCTTC AGTCAAAATC AAAGAAGAGA AAAGAGATAA 240 

AAAGGAAGAG GTAGCGACCG ATTCAAGCGA AAAAGTGGAG AAGAAAAAAT CAGAAGAGAA 300 

GCCTGAAAAG AAAGAGAATT CA 322 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Asn Arg Gin Val Ala His Tyr Gin Asp Tyr Ala Leu Asn Lys Glu Lys 
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1 5 10 15 

Leu Val Ala Phe Ala Met Ala Lys Arg Thr Lys Asp Lys Val Glu Gin 
20 25 30 

Glu Ser Gly Glu Gin Phe Phe Asn Leu Gly Gin Val Ser Tyr Gin Asn 
35 40 45 

Lys Lys Thr Gly Leu Val Thr Arg Val Arg Thr Asp Lys Ser Gin Tyr 
50 55 60 

Glu Phe Leu Phe Pro Ser Val Lys lie Lys Glu Glu Lys Arg Asp Lys 
65 70 75 80 

Lys Glu Glu Val Ala Thr Asp Ser Ser Glu Lys Val Glu Lys Lys Lys 
85 90 95 

Ser Glu Glu Lys Pro Glu Lys Lys Glu Asn Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

GGTTGTCGGC TGGCAATATA TCCCGTTTCC ATCTAAAGGT AGTACAATTG GTCCTTACCC 60 

AAATGGTATC AGATTAGAAG GTTTTCCAAA GTCAGAGTGG TACTACTTCG ATAAAAATGG 120 

AGTGCTACAA GAGTTTGTTG GTTGGAAAAC ATTAGAGATT AAAACTAAAG ACAGTGTTGG 180 

AAGAAAGTAC GGGGAAAAAC GTGAAGATTC AGAAGATAAA GAAGAGAAGC GTTATTATAC 240 

GAACTATTAC TTTAATCAAA ATCATTCTTT AGAGACAGGT TGGCTTTATG ATCAGTCTAA 300 

CTGGTATTAT , CTAGCTAAGA CGGAAATTAA TGGAGAAAAC TACCTTGGTG GTGAAAGACG 360 

TGCGGGGTGG ATAAACGATG ATTCGACTTG GTACTACCTA GATCCAACAA CTGGTATTAT 420 

GCAAACAGGT TGGCAATATC TAGGTAATAA GTGGTACTAC CTCCGTTCCT CAGGAGCAAT 480 

GGCCACTGGC TGGTATCAGG AAGGTACCAC TTGGTATTAT TTAGACCACC CAAATGGCGA 540 

TATGAAAACA GGTTGGCAAA ACCTTGGGAA CAAATGGTAC TATCTCCGTT CATCAGGAGC 600 

TATGGCAACT GGTTGGTATC AAGATGGTTC AACTTGGTAC TACCTAAATG CAGGTAATGG 660 

AGACATGAAG ACAGGTTGGT TCCAGGTCAA TGGCAACTGG TACTATGCTT ATAGCTCAGG 720 

TGCTTTGGCA GTGAATACGA CCGTAGATGG CTATTCTGTC AACTATAATG GCGAATGGGT 780 

TCGG 784 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Val Val Gly Trp Gin Tyr lie Pro Phe Pro Ser Lys Gly Ser Thr lie 
15 10 15 

Gly Pro Tyr Pro Asn Gly lie Arg Leu Glu Gly Phe Pro Lys Ser Glu 
20 25 30 

Trp Tyr Tyr Phe Asp Lys Asn Gly Val Leu Gin Glu Phe Val Gly Trp 
35 40 45 

Lys Thr Leu Glu lie Lys Thr Lys Asp Ser Val Gly Arg Lys Tyr Gly 
50 55 60 

Glu Lys Arg Glu Asp Ser Glu Asp Lys Glu Glu Lys Arg Tyr Tyr Thr 
65 70 75 80 

Asn Tyr Tyr Phe Asn Gin Asn His Ser Leu Glu Thr Gly Trp Leu Tyr 
85 90 95 

Asp Gin Ser Asn Trp Tyr Tyr Leu Ala Lys Thr Glu lie Asn Gly Glu 
100 105 110 

Asn Tyr Leu Gly Gly Glu Arg Arg Ala Gly Trp lie Asn Asp Asp Ser 
115 120 125 

Thr Trp Tyr Tyr Leu Asp Pro Thr Thr Gly lie Met Gin Thr Gly Trp 
130 135 140 

Gin Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met 
145 150 155 160 

Ala Thr Gly Trp Tyr Gin Glu Gly Thr Thr Trp Tyr Tyr Leu Asp His 
165 170 175 

Pro Asn Gly Asp Met Lys Thr Gly Trp Gin Asn Leu Gly Asn Lys Trp 
180 185 190 

Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Asp 
195 200 205 

Gly Ser Thr Trp Tyr Tyr Leu Asn Ala Gly Asn Gly Asp Met Lys Thr 
210 215 220 

Gly Trp Phe Gin Val Asn Gly Asn Trp Tyr Tyr Ala Tyr Ser Ser Gly 
225 230 235 240 

Ala Leu Ala Val Asn Thr Thr Val Asp Gly Tyr Ser Val Asn Tyr Asn 
245 250 255 

Gly Glu Trp Val Arg 
260 

INFORMATION FOR SEQ ID NO: 153: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1708 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



GGCCAAATCA 


GAATGGGTAG 


AAGACAAGGG 


AGC C TTTT AT 


mTV m/^mm/^ aO/^ 


A A 0 A mo 0 A A A 

AAGA i GGAAA 


D u 


GATGAAAAGA 


AATGCTTGGG 


T AGG AAC TTL 




(jL AAC AvjCj 1 


OOA A AOmA Am 

L-LAAAGi AAi 


ion 


AGAAGACTGG 


GTCTATGATT 


^^m/**i TV TV m TV /"^ TV 

CTCAATACGA 


TGCTTGGTTT 


m A m A m/^ "a "a "a O 
1 A 1 A 1 C AAACj 


0 A 0 A moo A 0 A 

GAGAl GGALA 


loU 


GCACGCAGAG 


AAAGAATGGC 


TCCAAATTAA 


AGGGAAGGAC 


TT\ TV mm TV mm m/^ tv 

TATTATTTCA 


AA IL G GGTGG 


0 /I n 


TTATCTACTG 


ACAAGTCAGT 


TV mm TV TV TT\ TV 

GGATTAATCA 


ACjCT TA 1 Cj i (j 


AA 1 CjL. i Ava i vjf 


omoooA A Aom 
G i GGLAAAG i 


inn 
0 U U 


ACAGCAAGGT 


TGGCTTTTTG 


TV TV TV TV TV TV mTV 

ACAAACAATA 


CLAAIC J. ICjCj 


mrnmT' a a rp/^ A 

i i i lAv^AiL-A 


A A 0 A A A A moo 

AAGAAAA 1 GG 


J u U 


AAACTATGCT 


GATAAAGAAT 


/~i TV mmmm/^ Tv 

GG ATTTTC G A 


CjAA i CjCj i L AC 


m A mm a rnrp tv m/"* 

i Ai i Ai 1 AiL. 


rpA A A AmOOOO 

i AAAAi GGGG 


/ion 


TGGCTACATG 


GCAGCCAATG 


TV TV m/*^/^ TV mmm/^ 

AATGGATTTG 


CjCjA i AAGCjAA 


rn/"' rnrp/"' rnrnrnrn 

i L i 1 J. i i i 


A nr>omo a a a mT* 
A i G i G AAA i i 


/ion 


TGATGGGAAA 


ATGGC TGAAA 


TV Ti /"< TV TV m/^/^/^m 

AAGAATGGGT 




r^AmA/^nn/*^ A A 
LA i Atj i L,AALt 


OT*'T'00'T' A OT* A 

L. i I GLi X AU i A 


D4i u 


CTTCAAATCC 


GGTGGTTACA 


m/~» TV TV r>r^f^ 7\ 71 

TG AC AGC C AA 


m/^ "A TV m/^/^ TV mm 

TGAATGGATT 


m/^/^/^AmA A/^/^ 


AAi G 1 IGG i 1 


c n n 

DUU 


TTATCTCAAA 


TCTGATGGGA 


TV TV TV m TV m ^ 

AAATAGCTGA 


■A 'A "A A "A 'A mo 0 

AAAAGAATGG 


y^m/^m A /^/^ A mm 


omo A m A omo a 
G i G A 1 AG i G A 


c n 
D 0 u 


AGCTTGGTAC 


TACTTCAAAT 


CCGGTGGTTA 


C ATGAC AGC C 


A A mo A A moo A 

AATGAATGGA 


mmmooo a m a a 
1 i J. GGGA i AA 


Ton 
/ z U 


GGAATCTTGG 


TTTTACCTCA 


Tfc Tfc m^*tm/^ TV m^^^^ 

AATCTGATGG 


/"» TV TV TV TV m TV /"l/^IXI 

GAAAATAGCT 


OAAAAAOAA m 

GAAAAAGAAT 


ooomom A oo a 
GGGTG TAG GA 


"7 Q n 
/oU 


TTCTCATAGT 


CAAGCTTGGT 


TV ^^mTv mm/~i tv tv 

ACTACTTCAA 


ATCTGGTGGC 


m A 0 A mo 000 A 

T AC ATGGC GA 


A A A A mo A 0 A 0 

AAAATGAGAC 


0 /I n 


AGTAGATGGT 


TATCAGCTTG 


GAAGCGATGG 


mTV TV TV m/^/^/^mm 

TAAATGGCTT 


TV 00 TV TV TV TV TV 

GGAGGAAAAA 


om A 0 A A A mo A 

C TAC AAATGA 


Q n n 


AAATGCTGCT 


TACTATCAAG 


TAGTGCCTGT 


m TV /~1 TV /-^ y*^ TV TV m 

TACAGCCAAT 


GTTTATGATT 


0 A 0 A mo 0 mo A 

CAGATGGTGA 


D c n 


AAAGCTTTCC 


TATATATCGC 


AAGGTAGTGT 


CGTATGGCTA 


OAmA AOOA m A 

GATAAGGATA 


0 A A A A A 0 mo A 

G AAAAAG x G A 


1 no n 
1 Uz U 


TGACAAGCGC 


TTGGCTATTA 


CTATTTCTGG 


TTTGTCAGGC 


m A m TV mO A A A A 

TATATGAAAA 


0 A 0 A A 0 A mmm 
GAGAAGAX 1 1 


1 AO n 


ACAAGCGCTA 


GATGCTAGTA 


TV TV y^mmrriTV m 

AGGAC TTT AT 


CCCTTATTAT 


0 A 0 A omo A mo 

GAGAGTGATG 


000 A ooommm 
GGGAGGGi i I 


1 1 /I n 


TTATCACTAT 


GTGGCTCAGA 




/~i /~i A om7\ /^om 
CLCACj 1 A(jjL 1 


momo A rnOTiTiTi 


OT^O ATA moo A 

G i GA i A 1 GGA 


1 onn 
xz u u 


AGTAGGCAAG 


AAATATTATT 


CGGCAGATGG 


CCTGCATTTT 


GATGGTTTTA 


AGCTTGAGAA 


1260 


TCCCTTCCTT 


TTCAAAGATT 


TAACAGAGGC 


TACAAACTAC 


AGTGCTGAAG 


AATTGGATAA 


1320 


GGTATTTAGT 


TTGCTAAACA 


TTAACAATAG 


CCTTTTGGAG 


AACAAGGGCG 


CTACTTTTAA 


1380 


GGAAGCCGAA 


GAACATTACC 


ATATCAATGC 


TCTTTATCTC 


CTTGCCCATA 


GTGCCCTAGA 


1440 


AAGTAACTGG 


GGAAGAAGTA 


AAATTGCCAA 


AGATAAGAAT 


AATTTCTTTG 


GCATTACAGC 


1500 


CTATGATACG 


ACCCCTTACC 


TTTCTGCTAA 


GACATTTGAT 


GATGTGGATA 


AGGGAATTTT 


1560 


AGGTGCAACC 


AAGTGGATTA 


AGGAAAATTA 


TATCGATAGG 


GGAAGAACTT 


TCCTTGGAAA 


1620 


CAAGGCTTCT 


GGTATGAATG 


TGGAATATGC 


TTCAGACCCT 


TATTGGGGCG 


AAAAAATTGC 


1680 
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TAGTGTGATG ATGAAAATCA ATGAGAAG 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Ala Lys Ser Glu Trp Val Glu Asp Lys Gly Ala Phe Tyr Tyr Leu Asp 
1 5 10 15 

Gin Asp Gly Lys Met Lys Arg Asn Ala Trp Val Gly Thr Ser Tyr Val 
20 25 30 

Gly Ala Thr Gly Ala Lys Val lie Glu Asp Trp Val Tyr Asp Ser Gin 
35 40 45 

Tyr Asp Ala Trp Phe Tyr lie Lys Ala Asp Gly Gin His Ala Glu Lys 
50 55 60 

Glu Trp Leu Gin lie Lys Gly Lys Asp Tyr Tyr Phe Lys Ser Gly Gly 
65 70 75 80 

Tyr Leu Leu Thr Ser Gin Trp lie Asn Gin Ala Tyr Val Asn Ala Ser 
85 90 95 

Gly Ala Lys Val Gin Gin Gly Trp Leu Phe Asp Lys Gin Tyr Gin Ser 
100 105 110 

Trp Phe Tyr lie Lys Glu Asn Gly Asn Tyr Ala Asp Lys Glu Trp lie 
115 120 125 

Phe Glu Asn Gly His Tyr Tyr Tyr Leu Lys Ser Gly Gly Tyr Met Ala 
130 135 140 

Ala Asn Glu Trp lie Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Phe 
145 150 155 160 

Asp Gly Lys Met Ala Glu Lys Glu Trp Val Tyr Asp Ser His Ser Gin 
165 170 175 

Ala Trp Tyr Tyr Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp 
180 185 190 

lie Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys lie 
195 200 205 

Ala Glu Lys Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr 
210 215 220 

Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp lie Trp Asp Lys 
225 230 235 240 

Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys lie Ala Glu Lys Glu 
245 250 255 

Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr Phe Lys Ser Gly 



150 



260 



265 



270 



Gly 



Tyr Met Ala Lys Asn 
275 



Glu Thr Val Asp Gly 
280 



Tyr Gin Leu Gly Ser 
285 



Asp Gly Lys Trp Leu Gly Gly Lys Thr Thr Asn Glu Asn Ala Ala Tyr 
290 295 300 

Tyr Gin Val Val Pro Val Thr Ala Asn Val Tyr Asp Ser Asp Gly Glu 
305 310 315 320 

Lys Leu Ser Tyr lie Ser Gin Gly Ser Val Val Trp Leu Asp Lys Asp 
325 330 335 

Arg Lys Ser Asp Asp Lys Arg Leu Ala lie Thr lie Ser Gly Leu Ser 
340 345 350 

Gly Tyr Met Lys Thr Glu Asp Leu Gin Ala Leu Asp Ala Ser Lys Asp 
355 360 365 

Phe lie Pro Tyr Ty^ Glu Ser Asp Gly His Arg Phe Tyr His Tyr Val 
370 ' 375 380 

Ala Gin Asn Ala Ser lie Pro Val Ala Ser His Leu Ser Asp Met Glu 
385 390 395 400 

Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gly Leu His Phe Asp Gly Phe 
405 410 415 

Lys Leu Glu Asn Pro Phe Leu Phe Lys Asp Leu Thr Glu Ala Thr Asn 
420 425 430 

Tyr Ser Ala Glu Glu Leu Asp Lys Val Phe Ser Leu Leu Asn lie Asn 
435 440 445 

Asn Ser Leu Leu Glu Asn Lys Gly Ala Thr Phe Lys Glu Ala Glu Glu 
450 455 460 

His Tyr His lie Asn Ala Leu Tyr Leu Leu Ala His Ser Ala Leu Glu 
465 470 475 480 

Ser Asn Trp Gly Arg Ser Lys lie Ala Lys Asp Lys Asn Asn Phe Phe 
485 490 495 

Gly lie Thr Ala Tyr Asp Thr Thr Pro Tyr Leu Ser Ala Lys Thr Phe 
500 505 510 

Asp Asp Val Asp Lys Gly lie Leu Gly Ala Thr Lys Trp lie Lys Glu 
515 520 525 

Asn Tyr lie Asp Arg Gly Arg Thr Phe Leu Gly Asn Lys Ala Ser Gly 
530 535 540 

Met Asn Val Glu Tyr Ala Ser Asp Pro Tyr Trp Gly Glu Lys lie Ala 
545 550 555 560 

Ser Val Met Met Lys lie Asn Glu Lys 
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INFORMATION FOR SEQ ID NO: 155: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



ATTTGCAGAT 


GATTCTGAAG 


GATGGCAGTT 


TGTCCAAGAA 


AATGGTAGAA 


CCTACTACAA 


60 


AAAGGGGGAT 


CTAAAAGAAA 


CCTACTGGAG 


AGTGATAGAT 


GGGAAGTACT 


ATTATTTTGA 


120 


TCCTTTATCC 


GGAGAGATGG 


TTGTCGGCTG 


GCAATATATA 


CCTGCTCCAC 


ACAAGGGGGT 


180 


TACGATTGGT 


CCTTCTCCAA 


GAATAGAGAT 


TGCTCTTAGA 


CCAGATTGGT 


TTTATTTTGG 


240 


TCAAGATGGT 


GTATTACAAG 


AATTTGTTGG 


CAAGCAAGTT 


TTAGAAGCAA 


AAACTGCTAC 


300 


GAATACCAAC 


AAACATCATG 


GGGAAGAATA 


TGATAGCCAA 


GCAGAGAAAC 


GAGTCTATTA 


360 


TTTTGAAGAT 


CAGCGTAGTT 


ATCATACTTT 


AAAAACTGGT 


TGGATTTATG 


AAGAGGGTCA 


420 


TTGGTATTAT 


TTACAGAAGG 


ATGGTGGCTT 


TGATTCGCGC 


ATCAACAGAT 


TGACGGTTGG 


480 


AGAGCTAGCA 


CGTGGTTGGG 


TTAAGGATTA 


CCCTCTTACG 


TATGATGAAG 


AGAAGCTAAA 


540 


AGCAGCTCCA 


TGGTACTATC 


TAAATCCAGC 


AACTGGCATT 


ATGCAAACAG 


GTTGGCAATA 


600 


TCTAGGTAAT 


AGATGGTACT 


ACCTCCATTC 


GTCAGGAGCT 


ATGGCAACTG 


GCTGGTATAA 


660 


GGAAGGCTCA 


ACTTGGTACT 


ATCTAGATGC 


TGAAAATGGT 


GATATGAGAA 


CTGGCTGGCA 


720 


AAACCTTGGG 


AACAAATGGT 


ACTATCTCCG 


TTCATCAGGA 


GCTATGGCAA 


CTGGTTGGTA 


780 


TCAGGAAAGT 


TCGACTTGGT 


ACTATCTAAA 


TGCAAGTAAT 


GGAGATATGA 


AAACAGGCTG 


840 


GTTCCAAGTC 


AATGGTAACT 


GGTACTATGC 


CTATGATTCA 


GGTGCTTTAG 


CTGTTAATAC 


900 


CACAGTAGGT 


GGTTACTACT 


TAAACTATAA 


TGGTGAATGG 


GTTAAG 
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(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

Val Phe Ala Asp Asp Ser Glu Gly Trp Gin Phe Val Gin Glu Asn Gly 
15 10 15 

Arg Thr Tyr Tyr Lys Lys Gly Asp Leu Lys Glu Thr Tyr Trp Arg Val 
20 25 30 

lie Asp Gly Lys Tyr Tyr Tyr Phe Asp Pro Leu Ser Gly Glu Met Val 
35 40 45 

Val Gly Trp Gin Tyr lie Pro Ala Pro His Lys Gly Val Thr lie Gly 
50 55 60 
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Pro Ser Pro Arg lie Glu lie Ala Leu Arg Pro Asp Trp Phe Tyr Phe / . ^ 

65 70 75 80 ^ 

Gly Gin Asp Gly Val Leu Gin Glu Phe Val Gly Lys Gin Val Leu Glu 

85 90 95 f 

Ala Lys Thr Ala Thr Asn Thr Asn Lys His His Gly Glu Glu Tyr Asp 

100 105 110 I 

Ser Gin Ala Glu Lys Arg Val Tyr Tyr Phe Glu Asp Gin Arg Ser Tyr 
115 120 125 

His Thr Leu Lys Thr Gly Trp lie Tyr Glu Glu Gly His Trp Tyr Tyr 
130 135 140 

Leu Gin Lys Asp Gly Gly Phe Asp Ser Arg lie Asn Arg Leu Thr Val 
145 150 155 160 

Gly Glu Leu Ala Arg Gly Trp Val Lys Asp Tyr Pro Leu Thr Tyr Asp 
165 170 175 

Glu Glu Lys Leu Lys Ala Ala Pro Trp Tyr Tyr Leu Asn Pro Ala Thr 
180 185 190 

Gly lie Met Gin Thr Gly Trp Gin Tyr Leu Gly Asn Arg Trp Tyr Tyr 
195 200 205 

Leu His Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Lys Glu Gly Ser 
210 215 220 

Thr Trp Tyr Tyr Leu Asp Ala Glu Asn Gly Asp Met Arg Thr Gly Trp 
225 230 235 240 

Gin Asn Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met 
245- 250 255 

Ala Thr Gly Trp Tyr Gin Glu Ser Ser Thr Trp Tyr Tyr Leu Asn Ala 
260 265 270 

Ser Asn Gly Asp Met Lys Thr Gly Trp Phe Gin Val Asn Gly Asn Trp 
275 280 285 

Tyr Tyr Ala Tyr Asp Ser Gly Ala Leu Ala Val Asn Thr Thr Val Gly 
290 295 300 

Gly Tyr Tyr Leu Asn Tyr Asn Gly Glu Trp Val Lys 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
TGTCGCTGCA AATGAAACTG AAGTAGCAAA AACTTCGCAG GATACAACGA CAGCTTCAAG 
TAGTTCAGAG CAAAATCAGT CTTCTAATAA AACGCAAACG AGCGCAGAAG TACAGACTAA 



60 
120 
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TV t^rnf~*r^f^ TV rn/^ 

L AL 1 CjCjIjA i Lj 


r^r^r*f\ tit> a nfT* a 
GGGAJ. i A i iA 


i i AAA^jVj A i 


O A TPPTTT'T' A 

Lja i LtLt i i l i a 


'A A POTO A A A O 
AALiL i L AAAlj 


1 RPI 

JL o U 




rnrnrn/~» TV A TV P'T' 

i 1 J. (jAL AAL 1 


AOTIATIA 7^f~*r^C* 

AL i A 1 AAtjCjL 


rprpo O rpmrprp TV m 
i iVjVji i i iAi 


A "yT A A TTT' A P 
A i i AA i J. L ALj 


TV rpppmpprprn A 
A i i Lij i i A 


Z fi u 




UAA 1 vjUL- A 1 


A A A T^T* A r^T* A 

IjAAA i i AL. i A 


r^PTT* A A ATT* A 


p rp p p TV m A TI A 


^nnnnnis. a a a 

ILiLfLLLAAAA 


J u U 




i A i CjAU AVjT i A 


A TIT* A A A ^ A 

Ai iAGAALiAtj 


mrpOOTT'T'T A T 
1 itjlji 1 i iAi 


OnPP A APTT' AP 
L i L AA(j i L- A\a 


ArnpppppmmTv 

A i LfLjLfoL i i a 


J D u 


TCjL 1 LA i L AA 


/"•A Arrt/^/^/^A ATI 

CjAA 1 tjtaL. AA i 


TO A T1T*00 A A A 
iGAl iOCjAAA 


T>A AOT'OOTiAO 
i AALi i \j\3 i AL 


'yAOT'TT' A AP A 
i AL i i L. AA^^A 


A pmpppprprpTV 

ALi i VjCLfLt i iA 


/ton 
^ z u 


CATGGCTAAA 


AGC C AATGGC 


A TV r^r^ TV A OTITI A 

AAGGAAGi lA 


rprpTlOTTlO A A Tl 

i i i L. i i G AA i 


OOTiO A A OO A O 

LrG i G AAGGALi 


OTI A TO A TOO A 

L i A i LA i GL A 


yi Q n 


AAATGAATGG 


/~i rn c* fy m tv ti/^ a rn 

CTbLTA i CjA 1 


CGAGLL i AI i 


OTiOOTiTi A TITIT 

L. i GG i i A i i i 


TTATOTIA AAA 

i i AiL i AAAA 


mOOOATiOOA A 

i LLGAi LGAA 




CTTATGCTAA 


OO A A A OTt/^/^ 

CCAAGAGTGG 


/"t TV TV TV TV TV /^TlOO 

L AAAAAG 1 GG 


GGGGG AAA i G 


OTI A OTI A TITITO 

G i AL iAi i i L 


A A o A A r^n^f^r^r^ 
AALAAL i GGG 


O UU 


GC TAT ATGGC 


TCGGAATGAG 


TGGLAAGGGA 


A OTI A OT> A TlTlTl 

AG i AG i A i i i 


OAOTIOOA AOT* 

GAG i GGAAG i 


O O TO O O A T>0 O 

LG i GLL A i GG 


D OU 


CGACTGAC GA 


TV /~*m/~^ TV mm tv rn/^ 

AGTGATTATG 


TV m/~</^rPTV /^rp/™' 

G ATGGT AC TC 


GG iAi AiG i i 


rpOOOOOOTiOTi 

iGGGGLLiLi 


OOTO A 00T>0 A 

GG i GAGL i LA 


Ton 


TV TV TV TV 71 TV TV TV TV 

AAGAAAAAAA 


TV TV mrprp/^ A A m 

AGATTTGAA 1 


GTLGGG i GGG 


n>'l»0 A O A O A O A 

i iCALALiALiA 


TiOOT A A O OOO 

i LiG i AALL GL 


TI A T*T1T*0 rpiivp A 

iAi i iL i i i A 


"7 Q n 
/ o U 


TV m TV TV m TV /"l 7V TV 

ATAATAGAGA 


A A A O A A OTIO 

AGAAL AAG i G 


OO AAOOOA A/"* 

GG AAG L G AAL 


A TIO OTA AOA A 

A i GG i AALjAA 


A O TIO A T>TiO A T* 

ACiLAi iLAi 


A TiT< A 0T*0 A OO 

Ai iALiGALL 


Q /I n 


TV TV TV m/^/*tm/^/~< 

AC AATGGTC G 


m TV m/^ TV TV T(/^ TV m 

TATCAATGAT 


TOO A AAA A OO 

1 GGAAAAAGG 


mrp ft rpTiO A TiO A 

i iAi iGAiGA 


OAAOOA AO TIO 

GAAL GAAG i G 


O A TiOOT*OTiO A 

LA i LG i G i LA 


Q n n 
y u u 


TTGTTCGTCT 


A rPT* A T* A 

AGGTl A 1 AGC 


OOTA A AOA AO 

GG 1 AAAGAAG 


A O A A OO A A TT 

AGAALiGAAi i 


OOOOO ATI A AO 

GGGGLAi AAL 


A T^Ti A A O O A O T 

Ai iAALLALi 


Q ^^n 
you 


m TV TV TV /^/^/^m/^m 

TAAACCGTCT 


r~*r~^ tv atiti/^oti 


i AiGGiGiG i 


A T^OTIOT A TIA O 

AiG iG iAi AC 


OTA TOOmO A A 

L i A i LtL i L AA 


A A TO A O A OOO 

AA i LALALLLf 


1 n 0 n 
± u z u 


ATCjCTCjAGAG 


TlO A OOOTI AAA 

1 GAUGL i AAA 


O A O A O O A TIT>0 

GAGAGLAi iG 


A A OTinr>A TI A A A 

AAL i i A i AAA 


OAAAT^AOAATi 

LAAA i AL AA i 


A i AAL L i Vj i 


1 n p n 
J. u ou 


CTTACCCTAT 


LTATTA 1 GA 1 


OTITIO A O A A TITI 

Gl IGAGAAl i 


OOO A A T> A rpOT 

GGGAA i A i G i 


A A ATA AOAOO 

AAA i AAGALjL 


A A O A O A OOTIO 

AALjALjALL i L 


1 1 /I n 


TV T\ /^m^ TV rmv 

CAAGTGATAC 


A r~^r^ tv otiti/*'^ 

AGGCACTTGG 


OTirPA A A A TIO A 

Gil AAAA i G A 


TO A A O A A OT A 

i GAAG AAG i A 


O A TlOO AOA OO 

LA i GGAGALG 


A TIO A A OO A OO 

A i LrAALL ALL 


1 o nn 
iz UU 


LGGCjTTATLA 


AAA Tl/^T>OT< A TI 

AAA i G 1 G i A i 


OTIOTIA T> A OOT> 

G i G i A i AGG i 


A TIOOTI A OTiTtTi 

Ai LGi AGi i i 


A TiTi A O A O A OO 

Ai iALALiALG 


OOTTIT^A A A AO 

LGi i iAAAAL 


1 0 n 

XZ 0 u 


ACCCAGATAT 


TTTAAAACAT 


GTAAACTGGG 


TAGCGGCCTA 


TACGAATGCT 


TTAGAATGGG 


1320 


AAAACCCTCA 


TTATTCAGGA 


AAAAAAGGTT 


GGCAATATAC 


CTCTTCTGAA 


TACATGAAAG 


1380 


GAATCCAAGG 


GCGCGTAGAT 


GTCAGCGTTT 


GGTAT 






1415 


(2) INFORMATION FOR SEQ ID NO: 158: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

Val Ala Ala Asn Glu Thr Glu Val Ala Lys Thr Ser Gin Asp Thr Thr 
15 10 15 

Thr Ala Ser Ser Ser Ser Glu Gin Asn Gin Ser Ser Asn Lys Thr Gin 
20 25 30 

Thr Ser Ala Glu Val Gin Thr Asn Ala Ala Ala His Trp Asp Gly Asp 
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35 40 45 

Tyr Tyr Val Lys Asp Asp Gly Ser Lys Ala Gin Ser Glu Trp lie Phe 
50 55 60 

Asp Asn Tyr Tyr Lys Ala Trp Phe Tyr lie Asn Ser Asp Gly Arg Tyr 
65 70 75 80 

Ser Gin Asn Glu Trp His Gly Asn Tyr Tyr Leu Lys Ser Gly Gly Tyr 
85 90 95 

Met Ala Gin Asn Glu Trp lie Tyr Asp Ser Asn Tyr Lys Ser Trp Phe 
100 105 110 

Tyr Leu Lys Ser Asp Gly Ala Tyr Ala His Gin Glu Trp Gin Leu lie 
115 120 125 

Gly Asn Lys Trp Tyr Tyr Phe Lys Lys Trp Gly Tyr Met Ala Lys Ser 
130 135 140 

Gin Trp Gin Gly Ser Tyr Phe Leu Asn Gly Gin Gly Ala Met Met Gin 
145 150 155 160 

Asn Glu Trp Leu Tyr Asp Pro Ala Tyr Ser Ala Tyr Phe Tyr Leu Lys 
165 170 175 

Ser Asp Gly Thr Tyr Ala Asn Gin Glu Trp Gin Lys Val Gly Gly Lys 
180 185 190 

Trp Tyr Tyr Phe Lys Lys Trp Gly Tyr Met Ala Arg Asn Glu Trp Gin 
195 200 205 

Gly Asn Tyr Tyr Leu Thr Gly Ser Gly Ala Met Ala Thr Asp Glu Val 
210 215 220 

lie Met Asp Gly Thr Arg Tyr lie Phe Ala Ala Ser Gly Glu Leu Lys 
225 230 235 240 

Glu Lys Lys Asp Leu Asn Val Gly Trp Val His Arg Asp Gly Lys Arg 
245 250 255 

Tyr Phe Phe Asn Asn Arg Glu Glu Gin Val Gly Thr Glu His Ala Lys 
260 265 270 

Lys Val lie Asp lie Ser Glu His Asn Gly Arg lie Asn Asp Trp Lys 
275 280 285 

Lys Val lie Asp Glu Asn Glu Val Asp Gly Val lie Val Arg Leu Gly 
290 295 300 

Tyr Ser Gly Lys Glu Asp Lys Glu Leu Ala His Asn lie Lys Glu Leu 
305 310 315 320 

Asn Arg Leu Gly lie Pro Tyr Gly Val Tyr Leu Tyr Thr Tyr Ala Glu 
325 330 335 

Asn Glu Thr Asp Ala Glu Ser Asp Ala Lys Gin Thr lie Glu Leu lie 
340 345 350 

Lys Lys Tyr Asn Met Asn Leu Ser Tyr Pro He Tyr Tyr Asp Val Glu 
355 360 365 

Asn Trp Glu Tyr Val Asn Lys Ser Lys Arg Ala Pro Ser Asp Thr Gly 
370 375 380 
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Thr Trp Val Lys lie lie Asn Lys Tyr Met Asp Thr Met Lys Gin Ala 
385 390 395 400 

Gly Tyr Gin Asn Val Tyr Val Tyr Ser Tyr Arg Ser Leu Leu Gin Thr 
405 410 415 

Arg Leu Lys His Pro Asp lie Leu Lys His Val Asn Trp Val Ala Ala 
420 425 430 

Tyr Thr Asn Ala Leu Glu Trp Glu Asn Pro His Tyr Ser Gly Lys Lys 
435 440 445 

Gly Trp Gin Tyr Thr Ser Ser Glu Tyr Met Lys Gly lie Gin Gly Arg 
450 455 460 

Val Asp Val Ser Val Trp Tyr 
465 470 

(2) INFORMATION FOR SEQ ID NO: .159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



TACGTCTCAG 


CCTACTTTTG 


TAAGAGCAGA 


AGAATCTCCA 


CAAGTTGTCG 


AAAAATCTTC 


60 


ATTAGAGAAG 


AAATATGAGG 


AAGCAAAAGC 


AAAAGCTGAT 


ACTGCCAAGA 


AAGATTACGA 


120 


AACGGCTAAA 


AAGAAAGCAG 


AAGACGCTCA 


GAAAAAGTAT 


GAAGATGATC 


AGAAGAGAAC 


180 


TGAGGAGAAA 


GCTCGAAAAG 


AAGCAGAAGC 


ATCTCAAAAA 


TTGAATGATG 


TGGCGCTTGT 


240 


TGTTCAAAAT 


GCATATAAAG 


AGTACCGAGA 


AGTTCAAAAT 


CAACGTAGTA 


AATATAAATC 


300 


TGACGCTGAA 


TATCAGAAAA 


AATTAACAGA 


GGTCGACTCT 


AAAATAGAGA 


AGGCTAGGAA 


360 


AGAGCAACAG 


GACTTGCAAA 


ATAAATTTAA 


TGAAGTAAGA 


GCAGTTGTAG 


TTCCTGAACC 


420 


AAATGCGTTG 


GCTGAGACTA 


AGAAAATVAGC 


AGAAGAAGCT 


AAAGCAGAAG 


AAAAAGTAGC 


480 


TAAGAGAAAA 


TATGATTATG 


CAACTCTAAA 


GGTAGCACTA 


GCGAAGAAAG 


AAGTAGAGGC 


540 


TAAGGAACTT 


GAAATTGAAA 


AACTTCAATA 


TGAAATTTCT 


ACTTTGGAAC 


AAGAAGTTGC 


600 


TACTGCTCAA 


CATCAAGTAG 


ATAATTTGAA 


AAAACTTCTT 


GCTGGTGCGG 


ATCCTGATGA 


660 


TGGCACAGAA 


GTTATAGAAG 


CTAAATTAAA 


AAAAGGAGAA 


GCTGAGCTAA 


ACGCTAAACA 


720 


AGCTGAGTTA 


GCAAAAAAAC 


AAACAGAACT 


TGAAAAACTT 


CTTGACAGCC 


TTGATCCTGA 


780 


AGGTAAGACT 


CAGGATGAAT 


TAGATAAAGA 


AGCAGAAGAA 


GCTGAGTTGG 


ATAAAAAAGC 


840 


TGATGAACTT 


CAAAATAAAG 


TTGCTGATTT 


AGAAAAAGAA 


ATTAGTAACC 


TTGAAATATT 


900 


ACTTGGAGGG 


GCTGATNCTG 


AAGATGATAC 


TGCTGCTCTT 


CAAAATAAAT 


TAGCTACTAA 


960 


AAAAGCTGAA 


TTGGAAAAAA 


CTCAAAAAGA 


ATTAGATGCA 


GCTCTTAATG 


AGTTAGGCCC 


1020 



156 



TGATGGAGAT GAAGAAGAAA CTCCAGCGCC GGCTCCTCAA CCAGAGCAAC CAGCTCCTGC 1080 

ACCAAAACCA GAGCAACCAG CTCCAGCTCC AAAACCAGAG CAACCAGCTC CTGCACCAAA 1140 

ACCAGAGCAA CCAGCTCCAG CTCCAAAACC AGAGCAACCA GCTCCAGCTC CAAAACCAGA 1200 

GCAACCAGCT AAGCCGGAGA AACCAGCTGA AGAGCCTACT CAACCAGAAA AACCAGCCAC 1260 

TCCAAAAACA GGCTGGAAAC AAGAAAACGG TATGTGGTAT TTCTACAATA CTGATGGTTC 1320 

AATGGCAATA GGTTGGCTCC AAAACAACGG TTCATGGTAC TACCTAAACG CTAACGGCGC 1380 

TATGGCAACA GGTTGGGTGA AAGATGGAGA TACCTGGTAC TATCTTGAAG CATCAGGTGC 1440 

TATGAAAGCA AGCCAATGGT TCAAAGTATC AGATAAATGG TACTATGTCA ACAGCAATGG 1500 

CGCTATGGCG ACAGGCTGGC TCCAATACAA TGGCTCATGG TACTACCTCA ACGCTAATGG 1560 

TGATATGGCG ACAGGATGGC TCCAATACAA CGGTTCATGG TATTACCTCA ACGCTAATGG 1620 

TGATATGGCG ACAGGATGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA ACGCTAACGG 1680 

TGCTATGGCT ACAGGTTGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA ACGCTAACGG 1740 

TTCAATGGCA ACAGGTTGGG TGAAAGATGG AGATACCTGG TACTATCTTG AAGCATCAGG 1800 

TGCTATGAAA GCAAGCCAAT GGTTCAAAGT ATCAGATAAA TGGTACTATG TCAATGGCTT 1860 

AGGTGCCCTT GCAGTCAACA CAACTGTAGA TGGCTATAAA GTCAATGCCA ATGGTGAATG 1920 

GGTT 1924 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Thr Ser Gin Pro Thr Phe Val Arg Ala Glu Glu Ser Pro Gin Val Val 
15 10 15 

Glu Lys Ser Ser Leu Glu Lys Lys Tyr Glu Glu Ala Lys Ala Lys Ala 
20 25 30 

Asp Thr Ala Lys Lys Asp Tyr Glu Thr Ala Lys Lys Lys Ala Glu Asp 
35 40 45 

Ala Gin Lys Lys Tyr Glu Asp Asp Gin Lys Arg Thr Glu Glu Lys Ala 
50 55 60 

Arg Lys Glu Ala Glu Ala Ser Gin Lys Leu Asn Asp Val Ala Leu Val 
65 70 75 80 

Val Gin Asn Ala Tyr Lys Glu Tyr Arg Glu Val Gin Asn Gin Arg Ser 
85 90 95 
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Lys Tyr Lys Ser Asp Ala Glu Tyr Gin Lys Lys Leu Thr Glu Val Asp 
100 105 110 

Ser Lys lie Glu Lys Ala Arg Lys Glu Gin Gin Asp Leu Gin Asn Lys 
115 120 125 

Phe Asn Glu Val Arg Ala Val Val Val Pro Glu Pro Asn Ala Leu Ala 
130 135 140 

Glu Thr Lys Lys Lys Ala Glu Glu Ala Lys Ala Glu Glu Lys Val Ala 
145 150 155 160 

Lys Arg Lys Tyr Asp Tyr Ala Thr Leu Lys Val Ala Leu Ala Lys Lys 
165 170 175 

Glu Val Glu Ala Lys Glu Leu Glu lie Glu Lys Leu Gin Tyr Glu lie 
180 185 190 

Ser Thr Leu Glu Gin Glu Val Ala Thr Ala Gin His Gin Val Asp Asn 
195 200 205 

Leu Lys Lys Leu Leu Ala Gly Ala Asp Pro Asp Asp Gly Thr Glu Val 
210 215 220 

lie Glu Ala Lys Leu Lys Lys Gly Glu Ala Glu Leu Asn Ala Lys Gin 
225 230 235 240 

Ala Glu Leu Ala Lys Lys Gin Thr Glu Leu Glu Lys Leu Leu Asp Ser 
245 250 255 

Leu Asp Pro Glu Gly Lys Thr Gin Asp Glu Leu Asp Lys Glu Ala Glu 
260 265 270 

Glu Ala Glu Leu Asp Lys Lys Ala Asp Glu Leu Gin Asn Lys Val Ala 
275 280 285 

Asp Leu Glu Lys Glu lie Ser Asn Leu Glu lie Leu Leu Gly Gly Ala 
290 295 300 

Asp Xaa Glu Asp Asp Thr Ala Ala Leu Gin Asn Lys Leu Ala Thr Lys 
305 310 315 320 

Lys Ala Glu Leu Glu Lys Thr Gin Lys Glu Leu Asp Ala Ala Leu Asn 
325 330 335 

Glu Leu Gly Pro Asp Gly Asp Glu Glu Glu Thr Pro Ala Pro Ala Pro 
340 345 350 

Gin Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu Gin Pro Ala Pro 
355 360 365 

Ala Pro Lys Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu Gin Pro 
370 375 380 

Ala Pro Ala Pro Lys Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu 
385 390 395 400 

Gin Pro Ala Lys Pro Glu Lys Pro Ala Glu Glu Pro Thr Gin Pro Glu 
405 410 415 

Lys Pro Ala Thr Pro Lys Thr Gly Trp Lys Gin Glu Asn Gly Met Trp 
420 425 430 

Tyr Phe Tyr Asn Thr Asp Gly Ser Met Ala lie Gly Trp Leu Gin Asn 
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435 440 445 

Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly 
450 455 460 

Trp Val Lys Asp Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gly Ala 
465 470 475 480 

Met Lys Ala Ser Gin Trp Phe Lys Val Ser Asp Lys Trp Tyr Tyr Val 
485 490 495 

Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser 
500 505 510 

Trp Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Thr Gly Trp Leu Gin 
515 520 525 

Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Thr 
530 535 540 

Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly 
545 550 555 560 

Ala Met Ala Thr Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu 
565 570 575 

Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Val Lys Asp Gly Asp Thr 
580 585 590 

Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met Lys Ala Ser Gin Trp Phe 
595 600 605 

Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gly Leu Gly Ala Leu Ala 
610 615 620 

Val Asn Thr Thr Val Asp Gly Tyr Lys Val Asn Ala Asn Gly Glu Trp 
625 630 635 640 

Val 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

TGGACAGGTG AAAGGTCATG CTACATTTGT GAAATCCATG ACAACTGAAA TGTACCAAGA 60 

ACAACAGAAC CATTCTCTCG CCTACAATCA ACGCTTGGNT TCGCAAAATC GCATTGTAGA 120 

TCCTTTTTTG GCGGAGGGAT ATGAGGTCAA TTACCAAGTG TCTGACGACC CTGATGCAGT 180 

CTATGGTTAC TTGTCTATTC CAAGTTTGGA AATCATGGAG CCGGTTTATT TGGGAGCAGA 240 

TTATCATCAT TTAGGGATGG GCTTGGCTCA TGTGGATGGT ACACCGCTGC CTCTGGATGG 300 

TACAGGGATT CGCTCAGTGA TTGCTGGGCA CCGTGCAGAG CCAAGCCATG TCTTTTTCCG 360 
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CCATTTGGAT CAGCTAAAAG TTGGAGATGC TCTTTATTAT GATAATGGCC AGGAAATTGT 420 

AGAATATCAG ATGATGGACA CAGAGATTAT TTTACCGTCG GAATGGGAAA AATTAGAATC 480 

GGTTAGCTCT AAAAATATCA TGACCTTGAT AACCTGCGAT CCGATTCCTA CCTTTAATAA 540 

ACGCTTATTA GTGAATTTTG AACGAGTCGC TGTTTATCAA AAATCAGATC CACAAACAGC 600 

TGCAGTTGCG AGGGTTGCTT TTACGAAAGA AGGACAATCT GTATCGCGTG TTGCAACCTC 660 

TCAATGGTTG 670 
(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Gly Gin Val Lys Gly His Ala Thr Phe Val Lys Ser Met Thr Thr Glu 
15 10 15 

Met Tyr Gin Glu Gin Gin Asn His Ser Leu Ala Tyr Asn Gin Arg Leu 
20 25 30 

Xaa Ser Gin Asn Arg lie Val Asp Pro Phe Leu Ala Glu Gly Tyr Glu 
35 40 45 

Val Asn Tyr Gin Val Ser Asp Asp Pro Asp Ala Val Tyr Gly Tyr Leu 
50 55 60 

Ser lie Pro Ser Leu Glu lie Met Glu Pro Val Tyr Leu Gly Ala Asp 
65 70 75 80 

Tyr His His Leu Gly Met Gly Leu Ala His Val Asp Gly Thr Pro Leu 
85 90 95 

Pro Leu Asp Gly Thr Gly lie Arg Ser Val lie Ala Gly His Arg Ala 
100 105 110 

Glu Pro Ser His Val Phe Phe Arg His Leu Asp Gin Leu Lys Val Gly 
115 120 125 

Asp Ala Leu Tyr Tyr Asp Asn Gly Gin Glu lie Val Glu Tyr Gin Met 
130 135 140 

Met Asp Thr Glu lie lie Leu Pro Ser Glu Trp Glu Lys Leu Glu Ser 
145 150 155 160 

Val Ser Ser Lys Asn lie Met Thr Leu lie Thr Cys Asp Pro lie Pro 
165 170 175 

Thr Phe Asn Lys Arg Leu Leu Val Asn Phe Glu Arg Val Ala Val Tyr 
180 185 190 

Gin Lys Ser Asp Pro Gin Thr Ala Ala Val Ala Arg Val Ala Phe Thr 
195 200 205 



160 



Lys Glu Gly Gin Ser Val Ser Arg Val Ala Thr Ser Gin Trp Leu 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 















fin 




A A /^r^ A A OTITT^ 




a PTT* A a p A a 




a a T^'P'P a a t'PP 


JL ^ U 


TATGTTGGAT 


CAGATTGATC 


AGTTGATGGT 


AGCTATTCGT 


AGCCAGGAAG 


AAACGACCCG 


180 


TCAGTACCAA 


CTTCAAGCCC 


TTTCGAGCCA 


GATTAATCCA 


CATTTCCTCT 


ATAACACTTT 


240 


GGACACCATC 


ATCTGGATGG 


CTGAATTTCA 


TGATAGTCAG 


CGAGTGGTGC 


AGGTGACCAA 


300 


GTCCTTGGCA 


ACCTATTTCC 


GCTTGGCGCT 


CAATCAAGGC 


AAGGACTTGA 


TTTGTCTCTC 


360 


TGACGAAATC 


AATCATGTCC 


GCCAGTATCT 


CTTTATCCAG 


AAACAACGCT 


ATGGAGATAA 


420 


GCTGGAATAC 


GAAATTAATG 


AAAATGTTGC 


CTTTGATAAT 


TTAGTCTTAC 


CCAAGCTGGT 


480 


CCTACAACCC 


CTTGTAGAAA 


ATGCTCTTTA 


CCATGGCATT 


AAGGAAAAGG 


AAGGTCAGGG 


540 


CCATATTAAA 


CTTTCTGTCC 


AGAAACAGGA 


TTCGGGATTG 


GTCATCCGTA 


TTGAGGATGA 


600 


TGGCGTTGGC 


TTCCAAGATG 


CTGGTGATAG 


TAGTCAAAGT 


CAACTCAAAC 


GTGGGGGAGT 


660 


TGGTCTTCAA 


AATGTCGATC 


AACGGCTCAA 


ACTTCATTTT 


GGAGCCAATT 


ACCATATGAA 


720 


GATTGATTCT 


AGACCCCAAA 


AAGGGACGAA 


AGTTGAAATA 


TATATAAATA 


GAATAGAAAC 


780 


TAGC 












784 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

lie Ala Pro Leu Lys Asp Leu Arg Glu Thr Met Leu Glu lie Ala Ser 
15 10 15 

Gly Ala Gin Asn Leu Arg Ala Lys Glu Val Gly Ala Tyr Glu Leu Arg 
20 25 30 

Glu Val Thr Arg Gin Phe Asn Ala Met Leu Asp Gin lie Asp Gin Leu 
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35 40 45 

Met Val Ala He Arg Ser Gin Glu Glu Thr Thr Arg Gin Tyr Gin Leu 
50 55 60 

Gin Ala Leu Ser Ser Gin He Asn Pro His Phe Leu Tyr Asn Thr Leu 
65 70 75 80 

Asp Thr He He Trp Met Ala Glu Phe His Asp Ser Gin Arg Val Val 
85 90 95 

Gin Val Thr Lys Ser Leu Ala Thr Tyr Phe Arg Leu Ala Leu Asn Gin 
100 105 110 

Gly Lys Asp Leu He Cys Leu Ser Asp Glu He Asn His Val Arg Gin 
115 120 125 

Tyr Leu Phe He Gin Lys Gin Arg Tyr Gly Asp Lys Leu Glu Tyr Glu 
130 135 140 

He Asn Glu Asn Val Ala Phe Asp Asn Leu Val Leu Pro Lys Leu Val 
145 150 155 160 

Leu Gin Pro Leu Val Glu Asn Ala Leu Tyr His Gly He Lys Glu Lys 
165 170 175 

Glu Gly Gin Gly His He Lys Leu Ser Val Gin Lys Gin Asp Ser Gly 
180 185 190 

Leu Val He Arg He Glu Asp Asp Gly Val Gly Phe Gin Asp Ala Gly 
195 200 205 

Asp Ser Ser Gin Ser Gin Leu Lys Arg Gly Gly Val Gly Leu Gin Asn 
210 215 220 

Val Asp Gin Arg Leu Lys Leu His Phe Gly Ala Asn Tyr His Met Lys 
225 230 235 240 

He Asp Ser Arg Pro Gin Lys Gly Thr Lys Val Glu He Tyr He Asn 
245 250 255 

Arg He Glu Thr Ser 
260 

(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

TAGGTCATAT GGGACTTTTT TTCTACAACA AAATAGGCTC CATAATATCT ATAAGGGATT 60 

TACCCACTAC AAATATTATA GAGCCGAAAA TTCACATCTA ATATATGCAG ACTACTTTGA 120 

AATGAAATTA AAAAAATTAT TAAAGGATGA CACAAAAGTT TTTGAAAAAT CTACATTCAA 180 

ATTTGTAGAA GGATATAAAA TATACCTGAC AGAATCTAAA GAATCTGGAA TTAAACAAAT 240 



162 



GGACAATGTC ATAAAATATT TTGAGTTTAT TGAATCTAAA AGTATTGCTT TATATTTTCA 300 
AAAACGATTA AATGAGCTGA TAGAT 325 
(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Arg Ser Tyr Gly Thr Phe Phe Leu Gin Gin Asn Arg Leu His Asn lie 
1 5 10 15 

Tyr Lys Gly Phe Thr His Tyr Lys Tyr Tyr Arg Ala Glu Asn Ser His 
20 25 30 

Leu lie Tyr Ala Asp Tyr Phe Glu Met Lys Leu Lys Lys Leu Leu Lys 
35 40 45 

Asp Asp Thr Lys Val Phe Glu Lys Ser Thr Phe Lys Phe Val Glu Gly 
50 55 60 

Tyr Lys lie Tyr Leu Thr Glu Ser Lys Glu Ser Gly lie Lys Gin Met 
65 70 75 80 

Asp Asn Val lie Lys Tyr Phe Glu Phe lie Glu Ser Lys Ser lie Ala 
85 90 95 

Leu Tyr Phe Gin Lys Arg Leu Asn Glu Leu lie Asp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CAACGTTGAG AATTATTTGC GAATGTGTTT GGATAGCATT CAGAATCAGA CGTATCAAAA 60 

TTTTGAGTGT TTATTAATCA ATGATGGCTC TCCAGATCAT TCATCCAAAA TATGTGAAGA 120 

ATTTGTAGAG AAAGATTCTC GTTTCAAATA TTTTGAGAAA GCAAACGGCG GTCTTTCATC 180 

AGCTCGTAAC CTAGGTATTG AATGTTCGGG GGGGGGCGTA CATTACTTTT GTAGACTC 238 
(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Asn Val Glu Asn Tyr Leu Arg Met Cys Leu Asp Ser lie Gin Asn Gin 
15 10 15 

Thr Tyr Gin Asn Phe Glu Cys Leu Leu lie Asn Asp Gly Ser Pro Asp 
20 25 30 

His Ser Ser Lys lie Cys Glu Glu Phe Val Glu Lys Asp Ser Arg Phe 
35 40 45 

Lys Tyr Phe Glu Lys Ala Asn Gly Gly Leu Ser Ser Ala Arg Asn Leu 
50 55 60 

Gly lie Glu Cys Ser Gly Gly Gly Val His Tyr Phe Cys Arg Leu 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 742 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

CTACTATCAA TCAAGTTCTT CAGCCATTGA GGCCACCATT GAGGGCAACA GCCAAACGAC 60 

CATCAGCCAG ACTAGCCACT TTATTCAGTC TTATATCAAA AAACTAGAAA CCACCTCGAC 120 

TGGTTTGACC CAGCAGACGG ATGTTCTGGC CTATGCTGAG AATCCCAGTC AAGACAAGGT 180 

CGAGGGAATC CGAGATTTGT TTTTGACCAT CTTGAAGTCA GATAAGGACT TGAAAACTGT 240 

TGTGCTGGTG ACCAAATCTG GTCAGGTCAT TTCTACAGAT GACAGTGTGC AGATGAAAAC 300 

TTCCTCTGAT ATGATGGCTG AGGATTGGTA CCAAAAGGCC ATTCATCAGG GAGCTATGCC 360 

TGTTTTGACT CCAGCTCGTA AATCAGATAG TCAGTGGGTC ATTTCTGTCA CTCAAGAACT 420 

TGTTGATGCA AAGGGAGCCA ATCTTGGTGT GCTTCGTTTG GATATTTCTT ATGAAACTCT 480 

GGAAGCCTAT CTCAATCAAC TCCAGTTGGG GCAGCAGGGC TTTGCCTTCA TTATCAATGA 540 

AAACCATGAA TTTGTCTACC ATCCTCAACA CACAGTTTAT AGTTCGTCTA GCAAAATGGA 600 

GGCTATGAAA CCCTACATCG ATACAGGTCA GGGTTATACT CCTGGTCACA AATCCTACGT 660 

CAGTCAAGAG AAGATTGCAG GAACTGATTG GACGGTGCTT GGCGTGTCAT CATTGGAAAA 720 

GTTAGACCAG GTTCGGAGTC AG 742 

(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Tyr Tyr Gin Ser Ser Ser Ser Ala lie Glu Ala Thr lie Glu Gly Asn 
15 10 15 

Ser Gin Thr Thr lie Ser Gin Thr Ser His Phe lie Gin Ser Tyr lie 
20 25 30 

Lys Lys Leu Glu Thr Thr Ser Thr Gly Leu Thr Gin Gin Thr Asp Val 
35 40 45 

Leu Ala Tyr Ala Glu Asn Pro Ser Gin Asp Lys Val Glu Gly lie Arg 
50 55 60 

Asp Leu Phe Leu Thr lie Leu Lys Ser Asp Lys Asp Leu Lys Thr Val 
65 70 75 80 

Val Leu Val Thr Lys Ser Gly Gin Val lie Ser Thr Asp Asp Ser Val 
85 90 95 

Gin Met Lys Thr Ser Ser Asp Met Met Ala Glu Asp Trp Tyr Gin Lys 
100 105 110 

Ala lie His Gin Gly Ala Met Pro Val Leu Thr Pro Ala Arg Lys Ser 
115 120 125 

Asp Ser Gin Trp Val lie Ser Val Thr Gin Glu Leu Val Asp Ala Lys 
130 135 140 

Gly Ala Asn Leu Gly Val Leu Arg Leu Asp lie Ser Tyr Glu Thr Leu 
145 150 155 160 

Glu Ala Tyr Leu Asn Gin Leu Gin Leu Gly Gin Gin Gly Phe Ala Phe 
165 170 175 

lie lie Asn Glu Asn His Glu Phe Val Tyr His Pro Gin His Thr Val 
180 185 190 

Tyr Ser Ser Ser Ser Lys Met Glu Ala Met Lys Pro Tyr lie Asp Thr 
195 200 205 

Gly Gin Gly Tyr Thr Pro Gly His Lys Ser Tyr Val Ser Gin Glu Lys 
210 215 220 

lie Ala Gly Thr Asp Trp Thr Val Leu Gly Val Ser Ser Leu Glu Lys 
225 230 235 240 

Leu Asp Gin Val Arg Ser Gin 
245 

INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 



GACAAAAACA 


TTAAAACGTC 


CTGAGGTTTT 


ATCACCTGCA 


GGGACTTTAG 


AGAAGCTAAA 


60 


GGTAGCTGTT 


CAGTATGGAG 


CAGATGCTGT 


CTTTATCGGT 


GGTCAGGCCT 


ATGGTCTTCG 


120 


TAGCCGTGCG 


GGAAACTTTA 


CTTTCGAACA 


GATGGAAGAA 


GGCGTGCAGT 


TTGCGGCCAA 


180 


GTATGGTGCC 


AAGGTCTATG 


TAGCGGCTAA 


TATGGTTATG 


CACGAAGGAA 


ATGAAGCTGG 


240 


TGCTGGTGAG 


TGGTTCCGTA 


AACTGCGTGA 


TATCGGGATT 


GCAGCAGTTA 


TCGTATCTGA 


300 


CCCAGCCTTG 


ATTATGATTG 


CAGTGACTGA 


AGCACCAGGC 


CTTGAAATCC 


ACCTTTCTAC 


360 


CCAAGCCAGT 


GCCACTAACT 


ATGAAACCCT 


TGAGTTCTGG 


AAAGAGCTAG 


GCTTGACTCG 


420 


TGTCGTTTTA 


GCGCGTGAGG 


TTTCAATGGA 


AGAATTAGCT 


GAGATCCGCA 


AACGTACAGA 


480 


TGTTGAAATT 


GAAGCCTTTG 


TCCATGGAGC 


TATGTGTATT 


TCATACTCTG 


GACGTTGTAC 


540 


TCTTTCAAAC 


CACATGAGTA 


TGCGTGATGC 


CAACCGTGGT 


GGATGTTCTC 


AGTCATGCCG 


600 


TTGGAAATAC 


GACCTTTACG 


ATATGCCATT 


TGGGAAAGAA 


CGTAAGAGTT 


TGCAGGGTGA 


660 


GATTCCAGAA 


GAATTTTCAA 


TGTCAGCCGT 


TGACATGTCT 


ATGATTGACC 


ANATTCCAGA 


720 


TATGATTGAA 


AATGGTGTGG 


ACAGTCTAAA 


AATCGAAGGA 


CGTATGNAGT 


CTATTCACTA 


780 


NGTATCAACA 


GTAACCAACT 


GCTACAAGGC 


GGCTGTGGAT 


GCCTATCTTG 


AAAGTCCTGA 


840 


AAAGTTTGAA 


GCTATCAAAC 


AAGACTTGGT 


GGACGAGATG 


TGGAAGGTTG 


CCCAACGTGA 


900 


ACTGGCTACA 


GGATTTTACT 


ATGGTACACC 


ATCTGAAAAT 


GAGCAGTTGT 


TTGGTGCTCG 


960 


TCGTAAAATC 


CCTGAGTACA 


AGTTTGTCGC 


TGAAGTGGTT 


TCTTATGATG 


ATGCGGCACA 


1020 


AACAGCAACT 


ATTCGTCAAC 


GAAACGTCAT 


TAACGAAGGG 


GACCAAGTTG 


AGTTTTATGG 


1080 


TCCAGGTTTC 


CGTCATTTTG 


AAACCTATAT 


TGAAGATTTG 


CATGATGCTA 


AAGGCAATAA 


1140 


AATCGACCGC 


GCTCCAAATC 


CAATGGAACT 


ATTGACTATT 


AAAGTCCCAC 


AACCTGTTCA 


1200 


ATCAGGAGAC 


ATGGTTCGAG 


CTCTTAAAGA 


GGGGCTTATC 


AATCTTTATA 


AGGAAGATGG 


1260 


AACCAGCGTC 


ACAGTTCGTG 


CT 








1282 


(2) INFORMATION FOR SEQ ID NO: 172: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



Thr Lys Thr Leu Lys Arg Pro Glu Val Leu Ser Pro Ala Gly Thr Leu 
15 10 15 
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Glu Lys Leu Lys Val Ala Val Gin Tyr Gly Ala Asp Ala Val Phe lie 
20 25 30 

Gly Gly Gin Ala Tyr Gly Leu Arg Ser Arg Ala Gly Asn Phe Thr Phe 
35 40 45 

Glu Gin Met Glu Glu Gly Val Gin Phe Ala Ala Lys Tyr Gly Ala Lys 
50 55 60 

Val Tyr Val Ala Ala Asn Met Val Met His Glu Gly Asn Glu Ala Gly 
65 70 75 80 

Ala Gly Glu Trp Phe Arg Lys Leu Arg Asp lie Gly lie Ala Ala Val 
85 90 95 

lie Val Ser Asp Pro Ala Leu lie Met lie Ala Val Thr Glu Ala Pro 
100 105 110 

Gly Leu Glu lie His Leu Ser Thr Gin Ala Ser Ala Thr Asn Tyr Glu 
115 120 125 

Thr Leu Glu Phe Trp Lys Glu Leu Gly Leu Thr Arg Val Val Leu Ala 
130 135 140 

Arg Glu Val Ser Met Glu Glu Leu Ala Glu lie Arg Lys Arg Thr Asp 
145 150 155 160 

Val Glu lie Glu Ala Phe Val His Gly Ala Met Cys lie Ser Tyr Ser 
165 170 175 

Gly Arg Cys Thr Leu Ser Asn His Met Ser Met Arg Asp Ala Asn Arg 
180 185 190 

Gly Gly Cys Ser Gin Ser Cys Arg Trp Lys Tyr Asp Leu Tyr Asp Met 
195 200 205 

Pro Phe Gly Lys Glu Arg Lys Ser Leu Gin Gly Glu lie Pro Glu Glu 
210 215 220 

Phe Ser Met Ser Ala Val Asp Met Ser Met lie Asp Xaa lie Pro Asp 
225 230 235 240 

Met lie Glu Asn Gly Val Asp Ser Leu Lys lie Glu Gly Arg Met Xaa 
245 250 255 

Ser lie His Xaa Val Ser Thr Val Thr Asn Cys Tyr Lys Ala Ala Val 
260 265 270 

Asp Ala Tyr Leu Glu Ser Pro Glu Lys Phe Glu Ala lie Lys Gin Asp 
275 280 285 

Leu Val Asp Glu Met Trp Lys Val Ala Gin Arg Glu Leu Ala Thr Gly 
290 295 300 

Phe Tyr Tyr Gly Thr Pro Ser Glu Asn Glu Gin Leu Phe Gly Ala Arg 
305 310 315 320 

Arg Lys lie Pro Glu Tyr Lys Phe Val Ala Glu Val Val Ser Tyr Asp 
325 330 335 

Asp Ala Ala Gin Thr Ala Thr lie Arg Gin Arg Asn Val lie Asn Glu 
340 345 350 
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Gly Asp Gin Val Glu Phe Tyr Gly Pro Gly Phe Arg His Phe Glu Thr 
355 360 365 

Tyr lie Glu Asp Leu His Asp Ala Lys Gly Asn Lys lie Asp Arg Ala 
370 375 380 

Pro Asn Pro Met Glu Leu Leu Thr lie Lys Val Pro Gin Pro Val Gin 
385 390 395 400 

Ser Gly Asp Met Val Arg Ala Leu Lys Glu Gly Leu lie Asn Leu Tyr 
405 410 415 

Lys Glu Asp Gly Thr Ser Val Thr Val Arg Ala 
420 425 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

TTCTCAGGAG ACCTTTAAAA ATATCACCAA TAGCTTCTCC ATGCAAATCA ATCGTCGCGT 60 

CAACCAAGGA ACGCCTCGTG GTGCTGGGAA TATCAAGGGT GAAGACATCA AAAAAATCAC 120 

CGAAAACAAG GCCATTGAGT CTTATGTCAA ACGTATCAAC GCTATCGGAG ATTTGACTGG 180 

ATATGACCTG ATTGAAACGC CAGAAACCAA GAAGAATCTC ACTGCTGATC GTGCCAAGCG 240 

TTTTGGAAGT AGCTTGATGA TTACAGGTGT CAATGACTCC TCTAAAGAAG ACAAGTTTGT 300 

CTCTGGTTCT TATAAACTAG TCGAAGGAGA GCACTTAACC AACGACGACA AGGATAAAAT 360 

CCTCTTGCAC AAGGACTTGG CAGCCAAACA CGGCTGGAAA GTAGGGGACA AGGTTAAACT 420 

GGACTCTAAT ATCTACGATG CAGATAATGA AAAAGGAGCC AAGGAAACAG TTGAAGTGAC 480 

AATCAAGGGA CTCTTTGATG GTCATAATAA GTCAGCAGTA ACCTACTCAC AAGAACTTTA 540 

CGAAAACACA GCTATTACAG ACATTCACAC TGCTGCAAAA CTTTATGGAT ACACAGAAGA 600 

CACAGCCATT TATGGGGACG CAACCTTCTT TGTAACAGCA GACAAGAACT TGGATGATGT 660 

TATGAAAGAG TTGAATGGCA TCAGTGGTAT CAACTGGAAG AGCTACACAC TCGTCAAGAG 720 

CTCCTCTAAC TACCCAGCTC TTGAGCAATC TATCTCTGGT ATGTACAAGA TGGCCAAC 778 
(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:174: 

Ser Gin Glu Thr Phe Lys Asn lie Thr Asn Ser Phe Ser Met Gin lie 
1 5 10 15 

Asn Arg Arg Val Asn Gin Gly Thr Pro Arg Gly Ala Gly Asn lie Lys 
20 25 30 

Gly Glu Asp lie Lys Lys lie Thr Glu Asn Lys Ala lie Glu Ser Tyr 
35 40 45 

Val Lys Arg lie Asn Ala lie Gly Asp Leu Thr Gly Tyr Asp Leu lie 
50 55 60 

Glu Thr Pro Glu Thr Lys Lys Asn Leu Thr Ala Asp Arg Ala Lys Arg 
65 70 75 80 

Phe Gly Ser Ser Leu Met lie Thr Gly Val Asn Asp Ser Ser Lys Glu 
85 90 95 

Asp Lys Phe Val Ser Gly Ser Tyr Lys Leu Val Glu Gly Glu His Leu 
100 105 110 

Thr Asn Asp Asp Lys Asp Lys lie Leu Leu His Lys Asp Leu Ala Ala 
115 120 125 

Lys His Gly Trp Lys Val Gly Asp Lys Val Lys Leu Asp Ser Asn lie 
130 135 140 

Tyr Asp Ala Asp Asn Glu Lys Gly Ala Lys Glu Thr Val Glu Val Thr 
145 150 155 160 

lie Lys Gly Leu Phe Asp Gly His Asn Lys Ser Ala Val Thr Tyr Ser 
165 170 175 

Gin Glu Leu Tyr Glu Asn Thr Ala lie Thr Asp lie His Thr Ala Ala 
180 185 190 

Lys Leu Tyr Gly Tyr Thr Glu Asp Thr Ala lie Tyr Gly Asp Ala Thr 
195 200 205 

Phe Phe Val Thr Ala Asp Lys Asn Leu Asp Asp Val Met Lys Glu Leu 
210 215 220 

Asn Gly lie Ser Gly lie Asn Trp Lys Ser Tyr Thr Leu Val Lys Ser 
225 230 235 240 

Ser Ser Asn Tyr Pro Ala Leu Glu Gin Ser lie Ser Gly Met Tyr Lys 
245 250 255 

Met Ala Asn 



INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
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AGTAAATGCG CAATCAAATT CATTAATATT AATAGATGAA CCTGAAATCT CACTTCATCC 60 

GAGTGCAATC TATAAATTTA AAGAGTTTTT ACTTCAAGAG TGTTTAAATA AAAAACATCA 120 

AATTATTATC ACTACACATT CTACACAACT TATAAAAGAT TTTCCTAGAG AAGCCGTGAA 180 

ACTTTTAGTG AAAAACGGAG AAAAGGTAGA TGTTATTGAA AATATTGATT ATCAGGATGC 240 

ATTTTTTGAA TTAGGTGATG TGTATCATTC TAGGAAGATG ATTTATGTTG AAGATAGACT' 300 

AGCTAAATAT ATTCTAGAGT TTGTTATCAC TCATTCAGGT AGTGAGAATC TTAAACAGAA 360 

TTTAGTAGTG AGATATATTC CTGGTGGAGC AAATCAAATA ATTTGTAATA ATATTTTAAA 420 

CTCATCGTAT TTAGATTCCG ATAACCATTA TTTTTGGCTT GATGGAGATC AAAACACTAA 480 

TGTTAGTGAA TCAAATAATT TAATGAACTA TCTTGAAAAT GGTGTTGTTA TATCAGATAA 540 

AATTCCTGAA TCAGATAATA AAAATCTTGA TGATATTATA AAATTGATAA NGGGATGTCC 600 

AATTAAATTT AATGTTTCAG GTAATAAAGG GCAAAAAAAT AATATTGAAT TAATTGCGAA 660 

ACAAAGAAGC TTTATAGATT ATTGGGCTAA ATAC 694 
(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

Val Asn Ala Gin Ser Asn Ser Leu lie Leu lie Asp Glu Pro Glu lie 
15 10 15 

Ser Leu His Pro Ser Ala lie Tyr Lys Phe Lys Glu Phe Leu Leu Gin 
20 25 30 

Glu Cys Leu Asn Lys Lys His Gin lie lie lie Thr Thr His Ser Thr 
35 40 45 

Gin Leu lie Lys Asp Phe Pro Arg Glu Ala Val Lys Leu Leu Val Lys 
50 55 60 

Asn Gly Glu Lys Val Asp Val lie Glu Asn lie Asp Tyr Gin Asp Ala 
65 70 75 80 

Phe Phe Glu Leu Gly Asp Val Tyr His Ser Arg Lys Met lie Tyr Val 
85 90 95 

Glu Asp Arg Leu Ala Lys Tyr lie Leu Glu Phe Val lie Thr His Ser 
100 105 110 

Gly Ser Glu Asn Leu Lys Gin Asn Leu Val Val Arg Tyr lie Pro Gly 
115 120 125 

Gly Ala Asn Gin lie lie Cys Asn Asn lie Leu Asn Ser Ser Tyr Leu 
130 135 140 
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Asp Ser Asp Asn His Tyr Phe Trp Leu Asp Gly Asp Gin Asn Thr Asn 
145 150 155 160 

Val Ser Glu Ser Asn Asn Leu Met Asn Tyr Leu Glu Asn Gly Val Val 
165 170 175 

lie Ser Asp Lys lie Pro Glu Ser Asp Asn Lys Asn Leu Asp Asp lie 
180 185 190 

lie Lys Leu lie Xaa Gly Cys Pro lie Lys Phe Asn Val Ser Gly Asn 
195 200 205 

Lys Gly Gin Lys Asn Asn lie Glu Leu lie Ala Lys Gin Arg Ser Phe 
210 215 220 

lie Asp Tyr Trp Ala Lys Tyr 
225 230 

(2) INFORMATION FOR SEQ ID NO: 177: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



TTACCGCGTT 


CATCAAGATG 


TCAAACAAGT 


CATGACCTAT 


CAACCCATGG 


TGCGAGAAAT 


60 


ATTGAGTGAA 


CAAGACACCC 


CAGCAAACGA 


AGAGCTTGTG 


CTTGCTATGA 


TTTATACTGA 


120 


AACAAAAGGA 


AAAGAAGGCG 


ATGTTATGCA 


GTCTAGTGAG 


TCTGCAAGTG 


GTTCCACCAA 


180 


CACCATCAAT 


GATAATGCCT 


CTAGCATTCG 


GCAAGGCATT 


CAAACTCTGA 


CAGGCAATCT 


240 


CTATCTGGCG 


CAGAAGAAGG 


GGGTAGATAT 


CTGGACAGCT 


GTTCAAGCCT 


ATAATTTTGG 


300 


ACCTGCCTAT 


ATCGATTTTA 


TCGCCCAAAA 


TGGCAAGGAA 


AATACCCTGG 


CTCTAGCCAA 


360 


ACAGTACTCT 


CGTGAGACTG 


TTGCCCCCTT 


GCTTGGTAAT 


AGGACTGGAA 


AGACTTATAG 


420 


TTATATTCAC 


CCCATTTCCA 


TTTTTCACGG 


TGCTGAACTC 


TATGTAAATG 


GAGGAAACTA 


480 


TTATTATTCT 


AGACAGGTAC 


GACTTAACCT 


TTACATCATC 


AAATGTTTCA 


CTCTCTTTTC 


540 


AACATCTGGC 
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(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 
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Tyr Arg Val His Gin Asp Val Lys Gin Val Met Thr Tyr Gin Pro Met 
15 10 15 

Val Arg Glu lie Leu Ser Glu Gin Asp Thr Pro Ala Asn Glu Glu Leu 
20 25 30 

Val Leu Ala Met lie Tyr Thr Glu Thr Lys Gly Lys Glu Gly Asp Val 
35 40 45 

Met Gin Ser Ser Glu Ser Ala Ser Gly Ser Thr Asn Thr lie Asn Asp 
50 55 60 

Asn Ala Ser Ser lie Arg Gin Gly lie Gin Thr Leu Thr Gly Asn Leu 
65 70 75 80 

Tyr Leu Ala Gin Lys Lys Gly Val Asp lie Trp Thr Ala Val Gin Ala 
85 90 95 

Tyr Asn Phe Gly Pro Ala Tyr lie Asp Phe lie Ala Gin Asn Gly Lys 
100 105 110 

Glu Asn Thr Leu Ala Leu Ala Lys Gin Tyr Ser Arg Glu Thr Val Ala 
115 120 125 

Pro Leu Leu Gly Asn Arg Thr Gly Lys Thr Tyr Ser Tyr lie His Pro 
130 135 140 

lie Ser lie Phe His Gly Ala Glu Leu Tyr Val Asn Gly Gly Asn Tyr 
145 150 155 160 

Tyr Tyr Ser Arg Gin Val Arg Leu Asn Leu Tyr lie lie Lys Cys Phe 
165 170 175 

Thr Leu Phe Ser Thr Ser Gly 
180 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

GTGGATGGGC TTTAACTATC TTCGTATTCG CCGTGCGGCT AAAATTGTGG ACAATGAGGA 60 

GTTTGAAGCC TTGATTCGTA CGGGTCAATT GATTGATTTG CGCGACCCAG CAGAATTCCA 120 

CAGAAAACAT ATCCTTGGTG CACGCAATAT TCCTTCAAGT CAGTTGAAAA CTAGTCTTGC 180 

AGCCCTTCGT AAAGATAAAC CTGTCCTTCT CTACGAAAAC CAACGTGCGC AACGAGTTAC 240 

AAATGCAGCT CTTTACTTGA AAAAACAAGG TTTTTCTGAG ATTTATATCC TTTCTTATGG 300 

CTTGGATTCT TGGAAAGGGA AAGTGAAGAC TAGC 334 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

Trp Met Gly Phe Asn Tyr Leu Arg lie Arg Arg Ala Ala Lys lie Val 
15 10 15 

Asp Asn Glu Glu Phe Glu Ala Leu lie Arg Thr Gly Gin Leu lie Asp 
20 25 30 

Leu Arg Asp Pro Ala Glu Phe His Arg Lys His lie Leu Gly Ala Arg 
35 40 45 

Asn lie Pro Ser Ser Gin Leu Lys Thr Ser Leu Ala Ala Leu Arg Lys 
50 55 60 

Asp Lys Pro Val Leu Leu Tyr Glu Asn Gin Arg Ala Gin Arg Val Thr 
65 70 75 80 

Asn Ala Ala Leu Tyr Leu Lys Lys Gin Gly Phe Ser Glu lie Tyr lie 
85 90 95 

Leu Ser Tyr Gly Leu Asp Ser Trp Lys Gly Lys Val Lys Thr Ser 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

ACTAAACCAG CATCGTTCGC AGGAAAATAA GGACAATAAT CGTGTCTCTT ATGTGGATGG 60 

CAGCCAGTCA AGTCAGAAAA GTGAAAACTT GACACCAGAC CAGGTTAGCC AGAAAGAAGG 120 

AATTCAGGCT GAGCAAATTG TAATCAAAAT TACAGATCAG GGCTATGTAA CGTCACACGG 180 

TGACCACTAT CATTACTATA ATGGGAAAGT TCCTTATGAT GCCCTCTTTA GTGAAGAACT 240 

CTTGATGAAG GATCCAAACT ATCAACTTAA AGACGCTGAT ATTGTCAATG AAGTCAAGGG 300 

TGGTTATATC ATCAAGGTCG ATGGAAAATA TTATGTCTAC CTGAAAGATG CAGCTCATGC 360 

TGATAATGTT CGAACTAAAG ATGAAATCAA TCGTCAAAAA CAAGAACATG TCAAAGATAA 420 

TGAGAAGGTT AACTCTAATG TTGCTGTAGC AAGGTCTCAG GGACGATATA CGACAAATGA 480 

TGGTTATGTC TTTAATCCAG CTGATATTAT CGAAGATACG GGTAATGCTT ATATCGTTCC 540 

TCATGGAGGT CACTATCACT ACATTCCCAA AAGCGATTTA TCTGCTAGTG AATTAGCAGC 600 

AGCTAAAGCA CATCTGGCTG GAAAAAATAT GCAACCGAGT CAGTTAAGCT ATTCTTCAAC 660 
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AGCTAGTGAC AATAACACGC AATCTGTAGC AAAAGGATCA ACTAGCAAGC CAGCAAATAA 72 0 

ATCTGAAAAT CTCCAGAGTC TTTTGAAGGA ACTCTATGAT TCACCTAGCG CCCAACGTTA 780 

CAGTGAATCA GATGGCCTGG TCTTTGACCC TGCTAAGATT ATCAGTCGTA CACCAAATGG 840 

AGTTGCGATT CCGCATGGCG ACCATTACCA CTTTATTCCT TACAGCAAGC TTTCTGCCTT 900 

AGAAGAAAAG ATTGCCAGAA TGGTGCCTAT CAGTGGAACT GGTTCTACAG TTTCTACAAA 960 

TGCAAAACCT AATGAAGTAG TGTCTAGTCT AGGCAGTCTT TCAAGCAATC CTTCTTCTTT 1020 

AACGACAAGT AAGGAGCTCT CTTCAGCATC TGATGGTTAT ATTTTTAATC CAAAAGATAT 1080 

CGTTGAAGAA ACGGCTACAG CTTATATTGT AAGACATGGT GATCATTTCC ATTACATTCC 1140 

AAAATCAAAT CAAATTGGGC AACCGACTCT TCCAAACAAT AGTCTAGCAA CACCTTCTCC 1200 

ATCTCTTCCA ATCAATCCAG GAACTTCACA TGAGAAACAT GAAGAAGATG GATACGGATT 1260 

TGATGCTAAT CGTATTATCG CTGAAGATGA ATCAGGTTTT GTCATGAGTC ACGGAGACCA 1320 

CAATCATTAT TTCTTCAAGA AG 1342 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

Leu Asn Gin His Arg iSer Gin Glu Asn Lys Asp Asn Asn Arg Val Ser 
15 10 15 

Tyr Val Asp Gly Ser Gin Ser Ser Gin Lys Ser Glu Asn Leu Thr Pro 
20 25 30 

Asp Gin Val Ser Gin Lys Glu Gly He Gin Ala Glu Gin He Val He 
35 40 45 

Lys He Thr Asp Gin Gly Tyr Val Thr Ser His Gly Asp His Tyr His 
50 55 60 

Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe Ser Glu Glu Leu 
65 70 75 80 

Leu Met Lys Asp Pro Asn Tyr Gin Leu Lys Asp Ala Asp He Val Asn 
85 90 95 

Glu Val Lys Gly Gly Tyr He He Lys Val Asp Gly Lys Tyr Tyr Val 
100 105 110 

Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg Thr Lys Asp Glu 
115 120 125 

He Asn Arg Gin Lys Gin Glu His Val Lys Asp Asn Glu Lys Val Asn 
130 135 140 
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Ser Asn Val Ala Val Ala Arg Ser Gin Gly Arg Tyr Thr Thr Asn Asp 
145 150 155 160 

Gly Tyr Val Phe Asn Pro Ala Asp lie lie Glu Asp Thr Gly Asn Ala 
165 170 175 

Tyr lie Val Pro His Gly Gly His Tyr His Tyr lie Pro Lys Ser Asp 
180 185 190 

Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His Leu Ala Gly Lys 
195 200 205 

Asn Met Gin Pro Ser Gin Leu Ser Tyr Ser Ser Thr Ala Ser Asp Asn 
210 215 220 

Asn Thr Gin Ser Val Ala Lys Gly Ser Thr Ser Lys Pro Ala Asn Lys 
225 230 235 240 

Ser Glu Asn Leu Gin Ser Leu Leu Lys Glu Leu Tyr Asp Ser Pro Ser 
245 250 255 

Ala Gin Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe Asp Pro Ala Lys 
260 265 270 

lie lie Ser Arg Thr Pro Asn Gly Val Ala lie Pro His Gly Asp His 
275 280 285 

Tyr His Phe lie Pro Tyr Ser Lys Leu Ser Ala Leu Glu Glu Lys lie 
290 295 300 

Ala Arg Met Val Pro lie Ser Gly Thr Gly Ser Thr Val Ser Thr Asn 
305 310 315 320 

Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser Leu Ser Ser Asn 
325 330 335 

Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser Ala Ser Asp Gly 
340 345 350 

Tyr lie Phe Asn Pro Lys Asp lie Val Glu Glu Thr Ala Thr Ala Tyr 
355 360 365 

lie Val Arg His Gly Asp His Phe His Tyr lie Pro Lys Ser Asn Gin 
370 375 380 

lie Gly Gin Pro Thr Leu Pro Asn Asn Ser Leu Ala Thr Pro Ser Pro 
385 390 395 400 

Ser Leu Pro lie Asn Pro Gly Thr Ser His Glu Lys His Glu Glu Asp 
405 410 415 

Gly Tyr Gly Phe Asp Ala Asn Arg lie lie Ala Glu Asp Glu Ser Gly 
420 425 430 

Phe Val Met Ser His Gly Asp His Asn His Tyr Phe Phe Lys Lys 
435 440 445 

INFORMATION FOR SEQ ID NO: 1831 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 



TGACTACCTT 


GAAATCCCAC 


TTTACAGCTA 


TCTTGGTGGA 


TTCAACACTA 


AAGTTCTTCC 


60 


AACTCCAATG 


ATGAACATCA 


TCAACGGTGG 


TTCTCACTCT 


GACGCTCCAA 


TCGCTTTCCA 


120 


AGAGTTCATG 


ATCTTGCCAG 


TTGGTGCGCC 


AACATTTAAA 


GAAGCCCTTC 


GTTACGGTGC 


180 


TGAAATCTTC 


CACGCTCTTA 


AGAAAATCCT 


TAAATCACGT 


GGTTTGGAAA 


CTGCCGTAGG 


240 


TGACGAAGGT 


GGATTCGCTC 


CTCGTTTCGA 


AGGAACTGAA 


GATGGTGTTG 


AAACTATCCT 


300 


TGCTGCGATT 


GAAGCTGCTG 


GATATGTACC 


AGGTAAAGAC 


GTATTTATCG 


GATTTGACTG 


360 


TGCTTCATCA 


GAATTCTACG 


ATAAAGAACG 


TAAAGTTTAC 


GACTACACTA 


AATTTGAAGG 


420 


TGAAGGTGCT 


GCTGTTCGTA 


CATCTGCAGA 


ACAAATCGAC 


TACCTTGAAG 


AATTGGTTAA 


480 


CAAATACCCA 


ATCATCACTA 


TTGAAGATGG 


TATGGATGAA 


AACGACTGGG 


ATGGTTGGAA 


540 


AGCTCTTACT 


GAACGTCTTG 


GTAAGAAAGT 


ACAACTTGTT 


GGTGACGACT 


TCTTCGTAAC 


600 


AAACACTGAC 


TACCTTGCAC 


GTGGTATCCA 


AGAAGGTGCT 


GCTAACTCAA 


TCCTTATCAA 


660 


AGTTAACCAA 


ATCGGTACTC 


TTACTGAAAC 


TTTTGAAGCT 


ATCGAAATGG 


CTAAAGAAGC 


720 


TGGTTACACT 


GCTGTTGTAT 


CACACCGTTC 


AGGTGAAACT 


GAAGATTCAA 


CAATCGCTGA 


780 


TATTGCAGTT 


GCAACTAACG 


CAGGACAAAT 


CAAGACTGGT 


TCACTTTCAC 


GTACAGACCG 


840 


CATCGCTAAA 


TACAACCAAT 


TGCTTCGTAT 


CGAAGACCAA 


CTTGGTGAAG 


TAGCTGAATA 


900 


TCGTGGATTG 


AAATCATTCT 


ACAACCTTAA 


AAAA 
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(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

Asp Tyr Leu Glu lie Pro Leu Tyr Ser Tyr Leu Gly Gly Phe Asn Thr 
15 10 15 

Lys Val Leu Pro Thr Pro Met Met Asn lie lie Asn Gly Gly Ser His 
20 25 30 

Ser Asp Ala Pro He Ala Phe Gin Glu Phe Met He Leu Pro Val Gly 
35 40 45 

Ala Pro Thr Phe Lys Glu Ala Leu Arg Tyr Gly Ala Glu He Phe His 
50 55 60 

Ala Leu Lys Lys He Leu Lys Ser Arg Gly Leu Glu Thr Ala Val Gly 
65 70 75 80 
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Asp Glu Gly Gly Phe Ala Pro Arg Phe Glu Gly Thr Glu Asp Gly Val 

85 90 95 ; 

• .-.^ 

Glu Thr lie Leu Ala Ala lie Glu Ala Ala Gly Tyr Val Pro Gly Lys ''I 
100 105 110 

Asp Val Phe lie Gly Phe Asp Cys Ala Ser Ser Glu Phe Tyr Asp Lys 
115 120 125 

Glu Arg Lys Val Tyr Asp Tyr Thr Lys Phe Glu Gly Glu Gly Ala Ala 
130 135 140 

Val Arg Thr Ser Ala Glu Gin lie Asp Tyr Leu Glu Glu Leu Val Asn 
145 150 155 160 

Lys Tyr Pro lie lie Thr lie Glu Asp Gly Met Asp Glu Asn Asp Trp 
165 170 175 

Asp Gly Trp Lys Ala Leu Thr Glu Arg Leu Gly Lys Lys Val Gin Leu 
180 185 190 

Val Gly Asp Asp Phe Phe Val Thr Asn Thr Asp Tyr Leu Ala Arg Gly 
195 200 205 

lie Gin Glu Gly Ala Ala Asn Ser lie Leu lie Lys Val Asn Gin lie 
210 215 220 

Gly Thr Leu Thr Glu Thr Phe Glu Ala lie Glu Met Ala Lys Glu Ala 
225 230 235 240 

Gly Tyr Thr Ala Val Val Ser His Arg Ser Gly Glu Thr Glu Asp Ser 
245 250 255 

Thr lie Ala Asp lie Ala Val Ala Thr Asn Ala Gly Gin lie Lys Thr 
260 265 270 

Gly Ser Leu Ser Arg Thr Asp Arg lie Ala Lys Tyr Asn Gin Leu Leu 
275 280 285 

Arg lie Glu Asp Gin Leu Gly Glu Val Ala Glu Tyr Arg Gly Leu Lys 
290 295 300 

Ser Phe Tyr Asn Leu Lys Lys 
305 310 

(2) INFORMATION FOR SEQ ID NO: 185: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

TCGTATCTTT TTTTGGAGCA ATGTTCGCGT AGAAGGACAT TCCATGGATC CGACCCTAGC 60 

GGATGGCGAA ATTCTCTTCG TTGTAAAACA CCTTCCTATT GACCGTTTTG ATATCGTGGT 120 

GGCCCATGAG GAAGATGGCA ATAAGGACAT CGTCAAGCGC GTGATTGGAA TGCCTGGCGA 180 
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CACCATTCGT TACGAAAATG ATAAACTCTA CATCAATGAC AAAGAAACGG ACGAGCCTTA 



240 



TCTAGCAGAC TATATCAAAC GCTTCAAGGA TGACAAACTC CAAAGCACTT ACTCAGGCAA 



300 



GGGCTTTGAA GGAAATAAAG GAACTTTCTT TAGAAGTATC GCTCAAAAAG CTCAAGCCTT 



360 



CACAGTTGAT GTCAACTACA ACACCAACTT TAGCTTTACT GTTCCAGAAG GAGAATACCT 



420 



TCTCCTCGGA GATGACCGCT TGGTTTCGAG CGACAGCCGC CACGTAGGTA CCTTCAAAGC 



480 



AAAAGATATC ACAGGGGAAG CTAAATTCCG CTTATGGCCA ATCACCCGTA TCGGAACATT 



540 



T 



541 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

Arg lie Phe Phe Trp Ser Asn Val Arg Val Glu Gly His Ser Met Asp 
15 10 15 

Pro Thr Leu Ala Asp Gly Glu lie Leu Phe Val Val Lys His Leu Pro 
20 25 30 

lie Asp Arg Phe Asp lie Val Val Ala His Glu Glu Asp Gly Asn Lys 
35 40 45 

Asp He Val Lys Arg Val He Gly Met Pro Gly Asp Thr He Arg Tyr 
50 55 60 

Glu Asn Asp Lys Leu Tyr He Asn Asp Lys Glu Thr Asp Glu Pro Tyr 
65 70 75 80 

Leu Ala Asp Tyr He Lys Arg Phe Lys Asp Asp Lys Leu Gin Ser Thr 
85 90 95 

Tyr Ser Gly Lys Gly Phe Glu Gly Asn Lys Gly Thr Phe Phe Arg Ser 
100 105 110 

He Ala Gin Lys Ala Gin Ala Phe Thr Val Asp Val Asn Tyr Asn Thr 
115 120 125 

Asn Phe Ser Phe Thr Val Pro Glu Gly Glu Tyr Leu Leu Leu Gly Asp 
130 135 140 

Asp Arg Leu Val Ser Ser Asp Ser Arg His Val Gly Thr Phe Lys Ala 
145 150 155 160 

Lys Asp He Thr Gly Glu Ala Lys Phe Arg Leu Trp Pro He Thr Arg 
165 170 175 



He Gly Thr Phe 
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(2) INFORMATION FOR SEQ ID NO: 187: 



178 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

GGACTCTCTC AAAGATGTGA AAGCAAATGC TAGCGACAGC AAGCCTGCAC AGGACAAGAA 60 

GGATGCAAAA CAAGGAACGG AAGATAGTAA GGATTCAGAT AAGATGACTG AAACAAACTC 120 

AGTTCCGGCA GGAGTGATTG TGGTCAGTCT ACTTGCCCTC CTAGGCGTGA TTGCCTTCTG 180 

GCTGATTCGC CGTAAGAAAG AGTCAGAAAT CCAGCAATTA AGCACGGAAT TGATCAAGGT 240 

TCTAGGACAG CTAGATGCAG AAAAAGCGGA TAAAAAAGTC CTTGCCAAAG CCCAAAACCT 300 

TCTCCAAGAA ACCCTTGATT TCGTGAAAGA AGAAAATGGC TCAGCAGAGA CAGAAACTAA 360 

ACTAGTAGAG GAGCTTAAAG CAATCCTTGA CAAACTCAAG 400 
(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Asp Ser Leu Lys Asp Val Lys Ala Asn Ala Ser Asp Ser Lys Pro Ala 
15 10 15 

Gin Asp Lys Lys Asp Ala Lys Gin Gly Thr Glu Asp Ser Lys Asp Ser 
20 25 30 

Asp Lys Met Thr Glu Thr Asn Ser Val Pro Ala Gly Val He Val Val 
35 40 45 

Ser Leu Leu Ala Leu Leu Gly Val He Ala Phe Trp Leu He Arg Arg 
50 55 60 

Lys Lys Glu Ser Glu He Gin Gin Leu Ser Thr Glu Leu He Lys Val 
65 70 75 80 

Leu Gly Gin Leu Asp Ala Glu Lys Ala Asp Lys Lys Val Leu Ala Lys 
85 90 95 

Ala Gin Asn Leu Leu Gin Glu Thr Leu Asp Phe Val Lys Glu Glu Asn 
100 105 110 

Gly Ser Ala Glu Thr Glu Thr Lys Leu Val Glu Glu Leu Lys Ala He 
115 120 125 

Leu Asp Lys Leu Lys 
130 
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(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



CAAGAAATCC 


TATCATCTCT 


TCCAGAAGCA 


AACAGAGACG 


AGGGGAATTC 


AGACTCAGTT 


60 


GATTGAAGAA 


TCGCTTAGTC 


AGCAGACTAT 


AATCCAGTCC 


TTCAATGCTC 


AAACAGAATT 


120 


TATCCAAAGA 


TTGCGTGAGG 


CTCATGACAA 


CTACTCAGGC 


TATTCTCAGT 


CAGCCATCTT 


180 


TTATTCTTCA 


ACGGTCAATC 


CTTCGACTCG 


CTTTGTAAAT 


GCACTCATTT 


ATGCCCTTTT 


240 


AGCTGGAGTA 


GGAGCTTATC 


GTATCATGAT 


GGGTTCAGCC 


TTGACCGTCG 


GTCGTTTAGT 


300 


GACTTTTTTG 


AACTATGTTC 


AGCAATACAC 


CAAGCCCTTT 


AACGATATTT 


CTTCAGTGCT 


360 


AGCTGAGTTG 


CAAAGTGCTC 


TGGCTTGCGT 


AGAGCGTATC 


TATGGAGTCT 


TAGATAGCCC 


420 


TGAAGTGGCT 


GAAACAGGTA 


AGGAAGTCTT 


GACGACCAGT 


GACCAAGTTA 


AGGGAGCTAT 


480 


TTCCTTTAAA 


CATGTCTCTT 


TTGGCTACCA 


TCCTGAAAAA 


ATTTTGATTA 


AGGACTTGTC 


540 


TATCGATATT 


CCAGCTGGTA 


GTAAGGTAGC 


CATCGTTGGT 


CCGACAGGTG 


CTGGAAAATC 


600 


AACTCTTATC 


AATCTCCTTA 


TGCGTTTTTA 


TCCCATTAGC 


TCGGGAGATA 


TCTTGCTGGA 


660 


TGGGCAATCC 


ATTTATGATT 


ATACACGAGT 


ATCATTGAGA 


CAGCAGTTTG 


GTATGGTGCT 


720 


TCAAGAAACC 


TGGCTCACAC 


AAGGGACCAT 


TCATGATAAT 


ATTGCCTTTG 


GCAATCCTGA 


780 


AGCCAGTCGA 


GAGCAAGTAA 


TTGCTGCTGC 


CAAAGCAGCT 


AATGCAGACT 


TTTTCATCCA 


840 


ACAGTTGCCA 


CAGGGATACG 


ATACCAAGTT 


GGAAAATGCT 


GGAGAATCTC 


TCTCTGTCGG 


900 


CCAAGCTCAG 


CTCTTGACCA 


TAGCCCGAGT 


CTTTCTGGCT 


ATTCCAAAGA 


TTCTTATCTT 


960 


AGACGAGGCA 


ACTTCTTCCA 


TTGATACACG 


GACAGAAGTG 


CTGGTACAGG 


ATGCCTTTGC 


1020 


AAAACTCATG 


AAGGGCCGCA 


CAAGTTTCAT 


CATTGCTCAC 


CGTTTGTCAA 


CCATTCAGGA 


1080 


TGCGGATTTA 


ATTCTTGTCT 


TAGTAGATGG 


TGATATTGTT 


GAATATGGTA 


ACCATCAAGA 


1140 


ACTCATGGAT 


AGAAAGGGTA 


AGTATTACCA 


AATGCAAAAA 


GCTGCGGCTT 


TTAGTTCTGA 


1200 



A 1201 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 

Lys Lys Ser Tyr His Leu Phe Gin Lys Gin Thr Glu Thr Arg Gly lie 
15 10 15 

Gin Thr Gin Leu lie Glu Glu Ser Leu Ser Gin Gin Thr lie lie Gin 
20 25 30 

Ser Phe Asn Ala Gin Thr Glu Phe lie Gin Arg Leu Arg Glu Ala His 
35 40 45 

Asp Asn Tyr Ser Gly Tyr Ser Gin Ser Ala lie Phe Tyr Ser Ser Thr 
50 55 60 

Val Asn Pro Ser Thr Arg Phe Val Asn Ala Leu lie Tyr Ala Leu Leu 
65 70 75 80 

Ala Gly Val Gly Ala Tyr Arg lie Met Met Gly Ser Ala Leu Thr Val 
85 90 95 

Gly Arg Leu Val Thr Phe Leu Asn Tyr Val Gin Gin Tyr Thr Lys Pro 
100 105 110 

Phe Asn Asp lie Ser Ser Val Leu Ala Glu Leu Gin Ser Ala Leu Ala 
115 120 125 

Cys Val Glu Arg lie Tyr Gly Val Leu Asp Ser Pro Glu Val Ala Glu 
130 135 140 

Thr Gly Lys Glu Val Leu Thr Thr Ser Asp Gin Val Lys Gly Ala lie 
145 150 155 160 

Ser Phe Lys His Val Ser Phe Gly Tyr His Pro Glu Lys lie Leu lie 
165 170 175 

Lys Asp Leu Ser lie Asp lie Pro Ala Gly Ser Lys Val Ala lie Val 
180 185 190 

Gly Pro Thr Gly Ala Gly Lys Ser Thr Leu lie Asn Leu Leu Met Arg 
195 200 205 

Phe Tyr Pro lie Ser Ser Gly Asp lie Leu Leu Asp Gly Gin Ser lie 
210 215 220 

Tyr Asp Tyr Thr Arg Val Ser Leu Arg Gin Gin Phe Gly Met Val Leu 
225 230 235 240 

Gin Glu Thr Trp Leu Thr Gin Gly Thr lie His Asp Asn lie Ala Phe 
245 250 255 

Gly Asn Pro Glu Ala Ser Arg Glu Gin Val lie Ala Ala Ala Lys Ala 
260 265 270 

Ala Asn Ala Asp Phe Phe lie Gin Gin Leu Pro Gin Gly Tyr Asp Thr 
275 280 285 

Lys Leu Glu Asn Ala Gly Glu Ser Leu Ser Val Gly Gin Ala Gin Leu 
290 295 300 

Leu Thr lie Ala Arg Val Phe Leu Ala lie Pro Lys lie Leu lie Leu 
305 310 315 320 
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Asp Glu Ala Thr Ser Ser lie Asp Thr Arg Thr Glu Val Leu Val Gin 
325 330 335 

Asp Ala Phe Ala Lys Leu Met Lys Gly Arg Thr Ser Phe lie lie Ala 
340 345 350 

His Arg Leu Ser Thr lie Gin Asp Ala Asp Leu lie Leu Val Leu Val 
355 360 365 

Asp Gly Asp lie Val Glu Tyr Gly Asn His Gin Glu Leu Met Asp Arg 
370 375 380 

Lys Gly Lys Tyr Tyr Gin Met Gin Lys Ala Ala Ala Phe Ser Ser Glu 
385 390 395 400 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 



ACGAAATGCA 


GGGCAGACAG 


ATGCCTCGCA 


AATTGAAAAG 


GCGGCAGTTA 


GCCAAGGAGG 


60 


AAAAGCAGTG 


AAAAAAACAG 


AAATTAGTAA 


AGACGCAGAC 


TTGCACGAAA 


TTTATCTAGC 


120 


TGGAGGTTGT 


TTCTGGGGAG 


TGGAGGAATA 


TTTCTCACGT 


GTTCCCGGGG 


TGACGGATGC 


180 


CGTTTCAGGC 


TATGCAAATG 


GTAGAGGAGA 


AACAACCAAG 


TACGAATTGA 


TTAACCAAAC 


240 


AGGTCATGCA 


GAAACCGTCC 


ATGTCACCTA 


TGATGCCAAG 


CAAATTTCTC 


TCAAGGAAAT 


300 


CCTGCTTCAC 


TATTTCCGCA 


TTATCAATCC 


AACCAGCAAA 


AATAAACAAG 


GAAATGATGT 


360 


GGGGACCCAG 


TACCGTACTG 


GTGTTTATTA 


CACAGATGAC 


AAGGATTTGG 


AAGTGATTAA 


420 


CCAAGTCTTT 


GATGAGGTGG 


CTAAGAAATA 


CGATCAACCT 


CTAGCAGTTG 


AAAAGGAAAA 


480 


CTTGAAGAAT 


TTTGTGGTGG 


CTGAGGATTA 


CCATCAAGAC 


TATCTCAAGA 


AAAATCCAAA 


540 


TGGCTACTGC 


CATATCAATG 


TTAATCAGGC 


GGCCTATCCT 


GTCATTGATG 


CCAGCAAATA 


600 


TCCAAAACCA 


AGTGATGAGG 


AATTGAAAAA 


GACCCTGTCA 


CCTGAGGAGT 


ATGCAGTTAC 


660 


CCAGGAAAAT 


CAAACAGAAC 


GAGCTTTCTC 


AAACCGTTAC 


TGGGATAAAT 


TTGAATCCGG 


720 


TATCTATGTG 


GATATAGCAA 


CTGGGGAACC 


TCTCTTTTCA 


TCAAAAGACA 


AATTTGAGTC . 


780 


TGGTTGTGGC 


TGGCCTAGTT 


TTACCCAACC 


CATCAGTCCA 


GATGTTGTCA 


CCTACAAGGA 


840 


AGATAAGTCC 


TACAATATGA 


CGCGTATGGA 


AGTGCGGAGC 


CGAGTAGGAG 


ATTCTCACCT 


900 


TGGGCATGTC 


TTTACGGATG 


GTCCACAGGA 


CAAGGGCGGC 


TTACGTTACT 


GTATCAATAG 


960 


CCTCTCTATC 


CGCTTTATTC 


CCAAAGACCA 


AATGGAAGAA 


AAAGGCTACG 


CTTATTTACT 


1020 


AGATTATGTT 


GAT 










1033 



(2) INFORMATION FOR SEQ ID NO: 192: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

Arg Asn Ala Gly Gin Thr Asp Ala Ser Gin lie Glu Lys Ala Ala Val 
15 10 15 

Ser Gin Gly Gly Lys Ala Val Lys Lys Thr Glu lie Ser Lys Asp Ala 
20 25 30 

Asp Leu His Glu lie Tyr Leu Ala Gly Gly Cys Phe Trp Gly Val Glu 
35 40 45 

Glu Tyr Phe Ser Arg Val Pro Gly Val Thr Asp Ala Val Ser Gly Tyr 
50 55 60 

Ala Asn Gly Arg Gly Glu Thr Thr Lys Tyr Glu Leu lie Asn Gin Thr 
65 70 75 80 

Gly His Ala Glu Thr Val His Val Thr Tyr Asp Ala Lys Gin lie Ser 
85 90 95 

Leu Lys Glu lie Leu Leu His Tyr Phe Arg lie lie Asn Pro Thr Ser 
100 105 110 

Lys Asn Lys Gin Gly Asn Asp Val Gly Thr Gin Tyr Arg Thr Gly Val 
115 120 125 

Tyr Tyr Thr Asp Asp Lys Asp Leu Glu Val lie Asn Gin Val Phe Asp 
130 135 140 

Glu Val Ala Lys Lys Tyr Asp Gin Pro Leu Ala Val Glu Lys Glu Asn 
145 150 155 160 

Leu Lys Asn Phe Val Val Ala Glu Asp Tyr His Gin Asp Tyr Leu Lys 
165 170 175 

Lys Asn Pro Asn Gly Tyr Cys His lie Asn Val Asn Gin Ala Ala Tyr 
180 185 190 

Pro Val lie Asp Ala Ser Lys Tyr Pro Lys Pro Ser Asp Glu Glu Leu 
195 200 205 

Lys Lys Thr Leu Ser Pro Glu Glu Tyr Ala Val Thr Gin Glu Asn Gin 
210 215 220 

Thr Glu Arg Ala Phe Ser Asn Arg Tyr Trp Asp Lys Phe Glu Ser Gly 
225 230 235 240 

lie Tyr Val Asp lie Ala Thr Gly Glu Pro Leu Phe Ser Ser Lys Asp 
245 250 255 

Lys Phe Glu Ser Gly Cys Gly Trp Pro Ser Phe Thr Gin Pro lie Ser 
260 265 270 

Pro Asp Val Val Thr Tyr Lys Glu Asp Lys Ser Tyr Asn Met Thr Arg 
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275 280 285 

Met Glu Val Arg Ser Arg Val Gly Asp Ser His Leu Gly His Val Phe 
290 295 300 

Thr Asp Gly Pro Gin Asp Lys Gly Gly Leu Arg Tyr Cys lie Asn Ser 
305 310 315 320 



Leu Ser lie Arg Phe lie Pro Lys Asp Gin Met Glu Glu Lys Gly Tyr 
325 330 335 

Ala Tyr Leu Leu Asp Tyr Val Asp 
340 

(2) INFORMATION FOR SEQ ID NO: 193: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

TGTATAGTTT TTAGCGCTTG TTCTTCTAAT TCTGNTAAAA ATGAAGAAAA TACTTCTAAA 60 

GAGCATGCGC CTGATAAAAT AGTTTTAGAT CATGCTTTCG GTCAAACTAT ATTAGATAAA 120 

AAACCTGAAA GAGTTGCAAC TATTGCTTGG GGAAATCATG ATGTAGCATT AGCTTTAGGA 180 

ATAGTTCCTG TTGGATTTTC AAAAGCAAAT TACGGTGTAA GTGCTGATAA AGGAGTTTTA 240 

CCATGGACAG AAGAAAAAAT CAAAGAACTA AATGGTAAAG CTAACCTATT TGACGATTTG 300 

GATGGACTTA ACTTTGAAGC AATATCAAAT TCTAAACCAG ATGTTATCTT AGCAGGTTAT 360 

TCTGGTATAA CTAAAGAAGA TTATGACACT CTATCA 396 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

Cys lie Val Phe Ser Ala Cys Ser Ser Asn Ser Xaa Lys Asn Glu Glu 
15 10 15 

Asn Thr Ser Lys Glu His Ala Pro Asp Lys lie Val Leu Asp His Ala 
20 25 30 

Phe Gly Gin Thr lie Leu Asp Lys Lys Pro Glu Arg Val Ala Thr lie 
35 40 45 

Ala Trp Gly Asn His Asp Val Ala Leu Ala Leu Gly lie Val Pro Val 
50 55 60 
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i 



Gly Phe Ser Lys Ala Asn Tyr Gly Val Ser Ala Asp Lys Gly Val Leu 
65 70 75 80 

Pro Trp Thr Glu Glu Lys lie Lys Glu Leu Asn Gly Lys Ala Asn Leu 
85 90 95 

Phe Asp Asp Leu Asp Gly Leu Asn Phe Glu Ala lie Ser Asn Ser Lys 
100 105 110 

Pro Asp Val lie Leu Ala Gly Tyr Ser Gly lie Thr Lys Glu Asp Tyr 
115 120 125 

Asp Thr Leu Ser 
130 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJSTDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 



GTGTGTCGAG 


CATATTCTGA 


AGCAAACCTA 


TCAAAATATA 


GAAATTATTT 


TAGTTGATGA 


60 


CGGTTCTACG 


GATAATTCTG 


GGGAAATTTG 


TGATGCTTTT 


ATGATGCAAG 


ATAATCGTGT 


120 


GCGAGTATTG 


CATCAAGAAA 


ATAAGGGGGG 


GGCAGCACAA 


GCTAAAAATA 


TGGGGATTAG 


180 


TGTAGCTAAG 


GGAGAGTACA 


TCACGATTGT 


TGATTCAGAT 


GATATCGTAA 


AAGAAAATAT 


240 


GATTGAAACT 


CTTTATCAGC 


AAGTCCAAGA 


AAAGGATGCA 


GATGTTGTTA 


TAGGGAATTA 


300 


CTATAATTAT 


GACGAAAGTG 


ACGGGAATTT 


TTATTTTTAT 


GTAACAGGGC 


AAGATTTTTG 


360 


CGTCGAAGAA 


TTAGCTATAC 


AAGAAATTAT 


GAACCGTCAA 


GCAGGAGATT 


GGAAATTCAA 


420 


TAGCTCGGCC 


TTTATATTGC 


CGACATTTAA 


GTTGATTAAA 


AAAGAATTAT 


TCAATGAAGT 


480 


TCACTTTTCA 


AATGGTCGCC 


GCTTTGATGA 


TGAAGCAACT 


ATGCATCGCT 


TTTATCTTTT 


540 


AGCCTCTAAA 


ATCGTCTTTA 


TAAACGATAA 


TCTCTATCTG 


TATAGAAGAC 


GTTCAGGAAG 


600 


CATCATGAGA 


ACGGAATTTG 


ATCTTTCCTG 


GGCAAGAGAT 


ATTGTTGAAG 


TGTTTTCTAA 


660 


GAAAATATCG 


GATTGTGTCT 


TGGCTGGTTT 


GGATGTCTCC 


GTTCTGCGTA 


TTCGATTTGT 


720 


CAATCTTTTA 


AAAGATTATA 


AGCAAACTTT 


AGAATACCAT 


CAATTAACAG 


ATACTGAGGA 


780 


ATATAAAGAT 


ATTTGTTTCA 


GATTAAAGTT 


GTTTTTTGAT 


GCAGAACAAA 


GAAATGGTAA 


840 



AAGT 844 
(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Cys Val Glu His lie Leu Lys Gin Thr Tyr Gin Asn lie Glu lie lie 
15 10 15 

Leu Val Asp Asp Gly Ser Thr Asp Asn Ser Gly Glu lie Cys Asp Ala 
20 25 30 

Phe Met Met Gin Asp Asn Arg Val Arg Val Leu His Gin Glu Asn Lys 
35 40 45 

Gly Gly Ala Ala Gin Ala Lys Asn Met Gly lie Ser Val Ala Lys Gly 
50 55 60 

Glu Tyr lie Thr lie Val Asp Ser Asp Asp lie Val Lys Glu Asn Met 
65 70 75 80 

lie Glu Thr Leu Tyr Gin Gin Val Gin Glu Lys Asp Ala Asp Val Val 
85 90 95 

lie Gly Asn Tyr Tyr Asn Tyr Asp Glu Ser Asp Gly Asn Phe Tyr Phe 
100 105 110 

Tyr Val Thr Gly Gin Asp Phe Cys Val Glu Glu Leu Ala lie Gin Glu 
115 120 125 

lie Met Asn Arg Gin Ala Gly Asp Trp Lys Phe Asn Ser Ser Ala Phe 
130 135 140 

lie Leu Pro Thr Phe Lys Leu lie Lys Lys Glu Leu Phe Asn Glu Val 
145 150 155 160 

His Phe Ser Asn Gly Arg Arg Phe Asp Asp Glu Ala Thr Met His Arg 
165 170 175 

Phe Tyr Leu Leu Ala Ser Lys lie Val Phe lie Asn Asp Asn Leu Tyr 
180 185 190 

Leu Tyr Arg Arg Arg Ser Gly Ser lie Met Arg Thr Glu Phe Asp Leu 
195 200 205 

Ser Trp Ala Arg Asp lie Val Glu Val Phe Ser Lys Lys lie Ser Asp 
210 215 220 

Cys Val Leu Ala Gly Leu Asp Val Ser Val Leu Arg lie Arg Phe Val 
225 230 235 240 

Asn Leu Leu Lys Asp Tyr Lys Gin Thr Leu Glu Tyr His Gin Leu Thr 
245 250 255 

Asp Thr Glu Glu Tyr Lys Asp lie Cys Phe Arg Leu Lys Leu Phe Phe 
260 265 270 

Asp Ala Glu Gin Arg Asn Gly Lys Ser 
275 280 

INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



GTGTTTGGAT 


AGCATTCAGA 


ATCAGACGTA 


TCAAAATTTT 


GAGTGTTTAT 


TAATCAATGA 


60 


TGGCTCTCCA 


GATCATTCAT 


CCAAAATATG 


TGAAGAATTT 


GTAGAGAAAG 


ATTCTCGTTT 


120 


CAAATATTTT 


GAGAAAGCAA 


ACGGCGGTCT 


TTCATCAGCT 


CGTAACCTAG 


GTATTGAATG 


180 


TTCGGGGGGG 


GCGTACATTA 


CTTTTGTAGA 


CTCTGATGAT 


TGGTTGGAAC 


ATGATGCTTT 


240 


AGACCGATTA 


TATGGTGCTT 


TGAAAAAGGA 


AAACGCAGAT 


ATTAGTATCG 


GGCGTTATAA 


300 


TTCTTATGAT 


GAAACACGCT 


ATGTGTATAT 


GACTTATGTT 


ACGGATCCAG 


ATGATTCTCT 


360 


AGAAGTGATA 


GAAGGTAAAG 


CAATTATGGA 


TAGGGAAGGT 


GTCGAAGAAG 


TCAGAAATGG 


420 


GAACTGGACT 


GTAGCTGTCT 


TGAAGTTATT 


CAAGAGAGAG 


TTACTACAAG 


ATTTACCATT 


480 


TCCTATAGGA 


AAAATTGCAG 


AGGATACTTA 


CTGGACATGG 


AAGGTACTTC 


TAAGAGCTTC 


540 


GAGGATAGTC 


TATTTGAATC 


GTTGTGTTTA 


CTGGTACCGT 


GTTGGTTTAT 


CTGATACTTT 


600 


ATCGAATACA 


TGGAGTGAAA 


AGCGTATGTA 


TGATGAAATT 


GGGGCTAGGG 


AAGAAAAGAT 


660 


AGCTATTTTA 


GCAAGTTCAG 


ACTATGACTT 


GACCAATCAT 


ATTTTGATTT 


ATAAAAATAG 


720 


ATTACAAAGA 


GTGATAGCAA 


AATTAGAAGA 


ACAAAATATG 


CAGTTCACAG 


AGATTTACAG 


780 


AAGAATGATG 


GAAAAATTGT 


CTTTACTTCC 


G 






811 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Cys Leu Asp Ser lie Gin Asn Gin Thr Tyr Gin Asn Phe Glu Cys Leu 
15 10 15 

Leu lie Asn Asp Gly Ser Pro Asp His Ser Ser Lys lie Cys Glu Glu 
20 25 30 

Phe Val Glu Lys Asp Ser Arg Phe Lys Tyr Phe Glu Lys Ala Asn Gly 
35 40 45 

Gly Leu Ser Ser Ala Arg Asn Leu Gly lie Glu Cys Ser Gly Gly Ala 
50 55 60 

Tyr lie Thr Phe Val Asp Ser Asp Asp Trp Leu Glu His Asp Ala Leu 
65 70 75 80 
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Asp Arg Leu Tyr Gly Ala Leu Lys Lys Glu Asn Ala Asp lie Ser lie 
85 90 95 

Gly Arg Tyr Asn Ser Tyr Asp Glu Thr Arg Tyr Val Tyr Met Thr Tyr 
100 105 110 

Val Thr Asp Pro Asp Asp Ser Leu Glu Val lie Glu Gly Lys Ala lie 
115 120 125 

Met Asp Arg Glu Gly Val Glu Glu Val Arg Asn Gly Asn Trp Thr Val 
130 135 140 

Ala Val Leu Lys Leu Phe Lys Arg Glu Leu Leu Gin Asp Leu Pro Phe 
145 150 155 160 

Pro lie Gly Lys lie Ala Glu Asp Thr Tyr Trp Thr Trp Lys Val Leu 
165 170 175 

Leu Arg Ala Ser Arg lie Val Tyr Leu Asn Arg Cys Val Tyr Trp Tyr 
180 185 190 

Arg Val Gly Leu Ser Asp Thr Leu Ser Asn Thr Trp Ser Glu Lys Arg 
195 200 205 

Met Tyr Asp Glu lie Gly Ala Arg Glu Glu Lys lie Ala lie Leu Ala 
210 215 220 

Ser Ser Asp Tyr Asp Leu Thr Asn His lie Leu lie Tyr Lys Asn Arg 
225 230 235 240 

Leu Gin Arg Val lie Ala Lys Leu Glu Glu Gin Asn Met Gin Phe Thr 
245 250 255 

Glu lie Tyr Arg Arg Met Met Glu Lys Leu Ser Leu Leu Pro 
260 265 270 

(2) INFORMATION FOR SEQ ID NO : 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

GTGCCTAGAT AGTATTATTA CTCAAACATA TAAAAATATT GAGATTGTTG TCGTTAATGA 60 

TGGTTCTACG GATGCTTCAG GTGAAATTTG TAAAGAATTT TCAGAAATGG ATCACCGAAT 120 

TCTCTATATA GAACAAGAAA ATGCTGGTCT TTCTGCCGCA CGAAACACCG GTCTGAATAA 180 

TATGTCCGGA AATTATGTGA CCTTTGTGGA CTCGGATGAT TGGATTGAGC AAGATTATGT 240 

AGAAACTCTA TATAAAAAAA TAGTAGAGTA TCAGGCTGAT ATTGCAGTTG GTAATTATTA 300 

TTCTTTCAAC GAAAGTGAAG GAATGTTCTA CTTTCATATA TTGGGAGACT CCTATTATGA 360 

GAAAGTATAT GATAATGTTT CTATCTTTGA GAACTTGTAT GAAACTCAAG AAATGAAGAG 420 

TTTTGCTTTG ATATCTGCTT GGGGTAAACT CTATAAGGCA AGATTGTTTG AGCAGTTGCG 480 
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CTTTGACATA 


GGTAAATTAG 


GAGAAGATGG 


TTACCTCAAT 


CAAAAGGTAT 


ATTTATTATC 


540 


AGAAAAGGTA 


ATTTATTTAA 


ATAAAAGTCT 


TTATGCTTAT 


CGGATTAGAA 


AAGGTAGTTT 


600 


ATCAAGAGTT 


TGGACAGAAA 


AGTGGATGCA 


CGCTTTAGTT 


GATGCTATGT 


CTGAACGTAT 


660 


TACGCTACTA 


GCTAATATGG 


GTTATCCTCT 


AGAGAAACAC 


TTGGCAGTTT 


ATCGTCAGAT 


720 


GTTGGAAGTC 


AGTCTCGCCA 


ACGGTCAAGC 


TAGTGGTTTA 


TCTGACACAG 


CAACGTATAA 


7 80 


AGAGTTTGAA 


ATGAAACAAA 


GGCTTTTAAA 


TCAGCTATCG 


AGACAAGAGG 


AAAGTGAAAA 


840 


GAAAGCCATT 


GTCCTCGCAG 


CAAACTATGG 


CTATGTAGAC 


CAAGTTTTAA 


CGACAATCAA 


900 


GTCTATTTGT 


TATCATAATC 


GTTCGATTCG 


TTTTTATCTG 


ATTCATAGCG 


ATTTTCCAAA 


960 


TGAATGGATT 


AAGCAATTAA 


ATAAGCGCTT 


AGAGAAGTTT 


GACTCAGAAA 


TTATTAATTG 


1020 


TCGGGTAACT 


TCTGAGCAAA 


TTTCATGTTA 


TAAATCGGAT 


ATTAGTTACA 


CAGTCTTTTT 


1080 


ACGCTATTTC 


ATAGCTGATT 


TCGTGCAAGA 


AGACAAGGCC 


CTCTACTTGG 


ACTGTGATCT 


1140 


AGTTGTAACG 


AAAAATCTGG 


ATGACTTGTT 


TGCTACAGAC 


TTACAAGATT 


ATCCTTTGGC 


1200 


TGCTGTTAGA 


GATTTTGGGG 


GCAGAGCTTA 


TTTTGGTCAA 


GAAATCTTTA 


ATGCCGGTGT 


1260 


TCTCTTGGTA 


AACAATGCTT 


TTTGGAAAAA 


AGAGAATATG 


ACCCAAAAAT 


TAATTGATGT 


1320 


AACCAATGAA 


TGGCATGATA 


AGGTGGATCA 


GGCAGATCAG 


AGCATCTTGA 


ATATGCTTTT 


1380 


TGAACATAAA 


TGGTTGGAAT 


TGGACTTTGA 


TTATAATCAT 


ATTGTCATTC 


ATAAACAGTT 


1440 


TGCTGATTAT 


CAATTGCCTG 


AGGGTCAGGA 


TTATCCTGCT 


ATTATTCACT 


ATCTTTCTCA 


1500 


TCGGAAACCG 


TGGAAAGATT 


TGGCGGCCCA 


AACCTATCGT 


GAAGTTTGGT 


GGTACTATCA 


1560 


TGGGCTTGAA 


TGGACAGAAT 


TGGGACAAAA 


CCATCATTTA 


CATCCATTAC 


AAAGATCTCA 


1620 


CATCTATCCA 


ATAAAGGAAC 


CTTTCACTTG 


TCTAATCTAT 


ACTGCCTCAG 


ACCATATTGA 


1680 


ACAAATTGAG 


ACATTGGTTC 


AATCCTTGCC 


TGATATTCAG 


TTTAAGATAG 


CAGCTAGAGT 


1740 


AATAGTTAGT 


GATCGATTGG 


CTCAGATGAC 


AATTTATCCA 


AACGTGACTA 


TATTTAACGG 


1800 


AATTCACTAT 


TTGGTAGATG 


TCGATAATGA 


ATTGGTAGAA 


ACCAGTCAAG 


TACTTTTAGA 


1860 


TATTAATCAT 


GGCGAAAAGA 


CAGAAGAAAT 


TCTCGATCAA 


TTTGCTAATC 


TTGGCAAGCC 


1920 


TATCTTATCC 


TTTGAAAATA 


CTAAAACCTA 


TGAAGTAGGT 


CAGGAGGCAT 


ATGCTGTTGA 


1980 


CCAAGTTCAA 


GCAATGATTG 


AAAAATTGAG 


AGAAATAAGC 


AAA 




2023 



(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 674 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQIDNO:200: 

Cys Leu Asp Ser lie lie Thr Gin Thr Tyr Lys Asn lie Glu lie Val 
1 5 10 15 

Val Val Asn Asp Gly Ser Thr Asp Ala Ser Gly Glu lie Cys Lys Glu 
20 25 30 

Phe Ser Glu Met Asp His Arg lie Leu Tyr lie Glu Gin Glu Asn Ala 
35 40 45 

Gly Leu Ser Ala Ala Arg Asn Thr Gly Leu Asn Asn Met Ser Gly Asn 
50 55 60 

Tyr Val Thr Phe Val Asp Ser Asp Asp Trp lie Glu Gin Asp Tyr Val 
65 70 75 80 

Glu Thr Leu Tyr Lys Lys lie Val Glu Tyr Gin Ala Asp lie Ala Val 
85 90 95 

Gly Asn Tyr Tyr Ser Phe Asn Glu Ser Glu Gly Met Phe Tyr Phe His 
100 105 110 

lie Leu Gly Asp Ser Tyr Tyr Glu Lys Val Tyr Asp Asn Val Ser lie 
115 120 125 

Phe Glu Asn Leu Tyr Glu Thr Gin Glu Met Lys Ser Phe Ala Leu lie 
130 135 140 

Ser Ala Trp Gly Lys Leu Tyr Lys Ala Arg Leu Phe Glu Gin Leu Arg 
145 150 155 160 

Phe Asp lie Gly Lys Leu Gly Glu Asp Gly Tyr Leu Asn Gin Lys Val 
165 170 175 

Tyr Leu Leu Ser Glu Lys Val lie Tyr Leu Asn Lys Ser Leu Tyr Ala 
180 185 190 

Tyr Arg lie Arg Lys Gly Ser Leu Ser Arg Val Trp Thr Glu Lys Trp 
195 200 205 

Met His Ala Leu Val Asp Ala Met Ser Glu Arg lie Thr Leu Leu Ala 
210 215 220 

Asn Met Gly Tyr Pro Leu Glu Lys His Leu Ala Val Tyr Arg Gin Met 
225 230 235 240 

Leu Glu Val Ser Leu Ala Asn Gly Gin Ala Ser Gly Leu Ser Asp Thr 
245 250 255 

Ala Thr Tyr Lys Glu Phe Glu Met Lys Gin Arg Leu Leu Asn Gin Leu 
260 265 270 

Ser Arg Gin Glu Glu Ser Glu Lys Lys Ala lie Val Leu Ala Ala Asn 
275 280 285 

Tyr Gly Tyr Val Asp Gin Val Leu Thr Thr lie Lys Ser lie Cys Tyr 
290 295 300 

His Asn Arg Ser lie Arg Phe Tyr Leu lie His Ser Asp Phe Pro Asn 
305 310 315 320 

Glu Trp lie Lys Gin Leu Asn Lys Arg Leu Glu Lys Phe Asp Ser Glu 
325 330 335 
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He He Asn Cys Arg Val Thr Ser Glu Gin He Ser Cys Tyr Lys Ser 
340 345 350 

Asp He Ser Tyr Thr Val Phe Leu Arg Tyr Phe He Ala Asp Phe Val 
355 360 365 

Gin Glu Asp Lys Ala Leu Tyr Leu Asp Cys Asp Leu Val Val Thr Lys 
370 375 380 

Asn Leu Asp Asp Leu Phe Ala Thr Asp Leu Gin Asp Tyr Pro Leu Ala 
385 390 395 400 

Ala Val Arg Asp Phe Gly Gly Arg Ala Tyr Phe Gly Gin Glu He Phe 
405 410 415 

Asn Ala Gly Val Leu Leu Val Asn Asn Ala Phe Trp Lys Lys Glu Asn 
420 425 430 

Met Thr Gin Lys Leu He Asp Val Thr Asn Glu Trp His Asp Lys Val 
435 440 445 

Asp Gin Ala Asp Gin Ser He Leu Asn Met Leu Phe Glu His Lys Trp 
450 455 460 

Leu Glu Leu Asp Phe Asp Tyr Asn His He Val He His Lys Gin Phe 
465 470 475 480 

Ala Asp Tyr Gin Leu Pro Glu Gly Gin Asp Tyr Pro Ala He He His 
485 490 495 

Tyr Leu Ser His Arg Lys Pro Trp Lys Asp Leu Ala Ala Gin Thr Tyr 
500 505 510 

Arg Glu Val Trp Trp Tyr Tyr His Gly Leu Glu Trp Thr Glu Leu Gly 
515 520 525 

Gin Asn His His Leu His Pro Leu Gin Arg Ser His He Tyr Pro He 
530 535 540 

Lys Glu Pro Phe Thr Cys Leu He Tyr Thr Ala Ser Asp His He Glu 
545 550 555 560 

Gin He Glu Thr Leu Val Gin Ser Leu Pro Asp He Gin Phe Lys He 
565 570 575 

Ala Ala Arg Val He Val Ser Asp Arg Leu Ala Gin Met Thr He Tyr 
580 585 590 

Pro Asn Val Thr He Phe Asn Gly He His Tyr Leu Val Asp Val Asp 
595 600 605 

Asn Glu Leu Val Glu Thr Ser Gin Val Leu Leu Asp He Asn His Gly 
610 615 620 

Glu Lys Thr Glu Glu He Leu Asp Gin Phe Ala Asn Leu Gly Lys Pro 
625 630 635 640 

He Leu Ser Phe Glu Asn Thr Lys Thr Tyr Glu Val Gly Gin Glu Ala 
645 650 655 

Tyr Ala Val Asp Gin Val Gin Ala Met He Glu Lys Leu Arg Glu He 
660 665 670 
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Ser Lys 

(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 



CATTCAGAAG 


CAGACCTATC 


AAAATCTGGA 


AATTATTCTT 


GTTGATGATG 


GTGCAACAGA 


60 


TGAAAGTGGT 


CGCTTGTGTG 


ATTCAATCGC 


TGAACAAGAT 


GACAGGGTGT 


CAGTGCTTCA 


120 


TAAAAAGAAC 


GAAGGATTGT 


CGCAAGCACG 


AAATGATGGG 


ATGAAGCAGG 


CTCACGGGGA 


180 


TTATCTGATT 


TTTATTGACT 


CAGATGATTA 


TATCCATCCA 


GAAATGATTC 


AGAGCTTATA 


240 


TGAGCAATTA 


GTTCAAGAAG 


ATGCGGATGT 


TTCGAGCTGT 


GGTGTCATGA 


ATGTCTATGC 


300 


TAATGATGAA 


AGCCCACAGT 


CAGCCAATCA 


GGATGACTAT 


TTTGTCTGTG 


ATTCTCAAAC 


360 


ATTTCTAAAG 


GAATACCTCA 


TAGGTGAAAA 


AATACCTGGG 


ACGATTTGCA 


ATAAGCTAAT 


420 


CAAGAGACAG 


ATTGCAACTG 


CCCTATCCTT 


TCCTAAGGGG 


TTGATTTACG 


AAGATGCCTA 


480 


TTACCATTTT 


GATTTAATCA 


AGTTGGCCAA 


GAAGTATGTG 


GTTAATACTA 


AACCCTATTA 


540 


TTACTATTTC 


CATAGAGGGG 


ATAGTATTAC 


GACCAAACCC 


TATGCAGAGA 


AGGATTTAGC 


600 


CTATATTGAT 


ATCTACCAAA 


AGTTTTATAA 


TGAAGTTGTG 


AAAAACTATC 


CTGACTTGAA 


660 


AGAGGTCGCT 


TTTTTCAGAT 


TGGCCTATGC 


CCACTTCTTT 


ATTCTGGATA 


AGATGTTGCT 


720 


AGATGATCAG 


TATAAACAGT 


TTGAAGCCTA 


TTCTCAGATT 


CATCGTTTTT 


TAAAAGGCCA 


780 


TGCCTTTGCT 


ATTTCTAGGA 


ATCCAATTTT 


CCGTAAGGGG 


AGAAGAATTA 


GTGCTTTGGC 


840 


CCTATTCATA 


AATATTTCCT 


TATATCGATT 


CTTATTACTG 


AAAAATATTG 


AAAAATCTAA 


900 


AAAATTACAT 
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(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02: 

lie Gin Lys Gin Thr Tyr Gin Asn Leu Glu lie lie Leu Val Asp Asp 
15 10 15 

Gly Ala Thr Asp Glu Ser Gly Arg Leu Cys Asp Ser lie Ala Glu Gin 
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20 25 30 

Asp Asp Arg Val Ser Val Leu His Lys Lys Asn Glu Gly Leu Ser Gin 
35 40 45 

Ala Arg Asn Asp Gly Met Lys Gin Ala His Gly Asp Tyr Leu lie Phe 
50 55 60 

lie Asp Ser Asp Asp Tyr lie His Pro Glu Met lie Gin Ser Leu Tyr 
65 70 75 80 

Glu Gin Leu Val Gin Glu Asp Ala Asp Val Ser Ser Cys Gly Val Met 
85 90 95 

Asn Val Tyr Ala Asn Asp Glu Ser Pro Gin Ser Ala Asn Gin Asp Asp 
100 105 110 

Tyr Phe Val Cys Asp Ser Gin Thr Phe Leu Lys Glu Tyr Leu lie Gly 
115 120 125 

Glu Lys lie Pro Gly Thr lie Cys Asn Lys Leu lie Lys Arg Gin lie 
130 135 140 

Ala Thr Ala Leu Ser Phe Pro Lys Gly Leu lie Tyr Glu Asp Ala Tyr 
145 150 155 160 

Tyr His Phe Asp Leu lie Lys Leu Ala Lys Lys Tyr Val Val Asn Thr 
165 170 175 

Lys Pro Tyr Tyr Tyr Tyr Phe His Arg Gly Asp Ser lie Thr Thr Lys 
180 185 190 

Pro Tyr Ala Glu Lys Asp Leu Ala Tyr lie Asp lie Tyr Gin Lys Phe 
195 200 205 

Tyr Asn Glu Val Val Lys Asn Tyr Pro Asp Leu Lys Glu Val Ala Phe 
210 215 220 

Phe Arg Leu Ala Tyr Ala His Phe Phe lie Leu Asp Lys Met Leu Leu 
225 230 235 240 

Asp Asp Gin Tyr Lys Gin Phe Glu Ala Tyr Ser Gin He His Arg Phe 
245 250 255 

Leu Lys Gly His Ala Phe Ala He Ser Arg Asn Pro He Phe Arg Lys 
260 265 270 

Gly Arg Arg He Ser Ala Leu Ala Leu Phe He Asn He Ser Leu Tyr 
275 280 285 

Arg Phe Leu Leu Leu Lys Asn He Glu Lys Ser Lys Lys Leu His 
290 295 300 

INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
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TAAGGCTGAT AATCGTGTTC AAATGAGAAC GACGATTAAT AATGAATCGC CATTGTTGCT 60 

TTCTCCGTTG TATGGCAATG ATAATGGTAA CGGATTATGG TGGGGGAACA CATTGAAGGG 120 

AGCATGGGAA GCTATTCCTG AAGATGTAAA GCCATATGCA GCGATTGAAC TTCATCCTGC 180 

AAAAGTCTGT AAACCAACAA GTTGTATTCC ACGAGATACG AAAGAATTGA GAGAATGGTA 240 

TGTCAAGATG TTGGAGGAAG CTCAAAGTCT AAACATTCCA GTTTTCTTGG TTATTATGTC 300 

GGCTGGAGAG CGTAATACAG TTCCTCCAGA GTGGTTAGAT GAACAATTCC AAAAGTATAG 360 

TGTGTTAAAA GGTGTTTTAA ATATTGAGAA TTATTGGATT TACAATAACC AGTTAGCTCC 420 

GCATAGTGCT AAATATTTGG AAGTTTGTGC CAAATATGGA GCGCATTTTA TCTGGCATGA 480 

TCATGAAAAA TGGTTCTGGG AAACTATTAT GAATGATCCG ACATTCTTTG AAGCGAGTCA 540 

AAAATATCAT AAAAATTTGG TGTTGGCAAC TAAAAATACG CCAATAAGAG ATGATGCGGG 600 

TACAGATTCT ATCGTTAGTG GATTTTGGTT GAGTGGCTTA TGTGATAACT GGGGCTCATC 660 

AACAGATACA TGGAAATGGT GGGAAAAACA TTATACAAAC ACATTTGAAA CTGGAAGAGC 720 

TAGGGATATG AGATCCTATG CATCGGAACC AGAATCAATG ATTGCTATGG AAATGATGAA 780 

TGTATATACT GGGGGAGGCA CAGTTTATAA TTTCGAATGT GCCGCGTATA CATTTATGAC 840 

AAATGATGTA CCAACTCCAG CATTTACTAA AGGTATTATT CCTTTCTTTA GACATGCTAT 900 

ACAAAATCCA GCTCCAAGTA AGGAAGAAGT TGTAAATAGA ACAAAAGCTG TATTTTGGAA 960 

TGGAGAAGGT AGGATTAGTT CATTAAACGG ATTTTATCAA GGACTTTATT CGAATGATGA 1020 

AACAATGCCT TTATATAATA ATGGGAGATA TCATATTCTT CCTGTAATAC ATGAGAAAAT 1080 

TGATAAGGAA AAGATTTCAT CTATATTCCC TAATGCAAAA ATTTTGACTA AAAATAGTGA 1140 

GGAATTGTCT AGTAAAGTCA ACTATTTAAA CTCGCTTTAT CCAAAACTTT ATGAAGGAGA 1200 

TGGGTATGCT CAGCGTGTAG GTAATTCCTG GTATATTTAT AATAGTAATG CTAATATCAA 1260 

TAAAAATCAG CAAGTAATGT TGCCTATGTA TACTAATAAT ACAAAGTCGT TATCGTTAGA 1320 

TTTGACGCCA CATACTTACG CTGTTGTTAA AGAAAATCCA AATAATTTAC ATATTTTATT 1380 

GAATAATTAC AGGACAGATA AGACAGCTAT GTGGGCATTA TCAGGAAATT TTGATGCATC 1440 

AAAAAGTTGG AAGAAAGAAG AATTAGAGTT AGCGAACTGG ATAAGCAAAA ATTATTCCAT 1500 

CAATCCTGTA GATAATGACT TTAGGACAAC AACACTTACA TTAAAAGGGC ATACTGGTCA 1560 

TAAACCTCAG ATAAATATAA GTGGCGATAA AAATCATTAT ACTTATACAG AAAATTGGGA 1620 

TGAGAATACC CATGTTTATA CCATTACGGT TAATCATAAT GGAATGGTAG AGATGTCTAT 1680 

AAATACTGAG GGGACAGGTC CAGTCTCTTT CCCAACACCA GATAAATTTA ATGATGGTAA 1740 

TTTGAATATA GCATATGCAA AACCAACAAC ACAAAGTTCT GTAGATTACA ATGGAGACCC 1800 

TAATAGAGCT GTGGATGGTA ACAGAAATGG TAATTTTAAC TCTGGTTCGG TAACACACAC 1860 

TAGGGCAGAT AATCCCTCTT GGTGGGAAGT CGATTTGAAA AAAATGGATA AAGTTGGGCT 1920 
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TGTTAAAATT TATAATCGCA CAGATGCTGA GACTCAACGT CTATCTAATT TT 1972 
(2) INFORMATION FOR SEQ ID NO:204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 04: 

Lys Ala Asp Asn Arg Val Gin Met Arg Thr Thr lie Asn Asn Glu Ser 
15 10 15 

Pro Leu Leu Leu Ser Pro Leu Tyr Gly Asn Asp Asn Gly Asn Gly Leu 
20 25 30 

Trp Trp Gly Asn Thr Leu Lys Gly Ala Trp Glu Ala lie Pro Glu Asp 
35 40 45 

Val Lys Pro Tyr Ala Ala lie Glu Leu His Pro Ala Lys Val Cys Lys 
50 55 60 

Pro Thr Ser Cys lie Pro Arg Asp Thr Lys Glu Leu Arg Glu Trp Tyr 
65 70 75 80 

Val Lys Met Leu Glu Glu Ala Gin Ser Leu Asn lie Pro Val Phe Leu 
85 90 95 

Val lie Met Ser Ala Gly Glu Arg Asn Thr Val Pro Pro Glu Trp Leu 
100 105 110 

Asp Glu Gin Phe Gin Lys Tyr Ser Val Leu Lys Gly Val Leu Asn lie 
115 120 125 

Glu Asn Tyr Trp lie Tyr Asn Asn Gin Leu Ala Pro His Ser Ala Lys 
130 135 140 

Tyr Leu Glu Val Cys Ala Lys Tyr Gly Ala His Phe lie Trp His Asp 
145 150 155 160 

His Glu Lys Trp Phe Trp Glu Thr lie Met Asn Asp Pro Thr Phe Phe 
165 170 175 

Glu Ala Ser Gin Lys Tyr His Lys Asn Leu Val Leu Ala Thr Lys Asn 
180 185 190 

Thr Pro lie Arg Asp Asp Ala Gly Thr Asp Ser lie Val Ser Gly Phe 
195 200 205 

Trp Leu Ser Gly Leu Cys Asp Asn Trp Gly Ser Ser Thr Asp Thr Trp 
210 215 220 

Lys Trp Trp Glu Lys His Tyr Thr Asn Thr Phe Glu Thr Gly Arg Ala 
225 230 235 240 

Arg Asp Met Arg Ser Tyr Ala Ser Glu Pro Glu Ser Met lie Ala Met 
245 250 255 
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Glu Met Met Asn Val Tyr Thr Gly Gly Gly Thr Val Tyr Asn Phe Glu 
260 265 270 

Cys Ala Ala Tyr Thr Phe Met Thr Asn Asp Val Pro Thr Pro Ala Phe 
275 280 285 

Thr Lys Gly lie lie Pro Phe Phe Arg His Ala lie Gin Asn Pro Ala 
290 295 300 

Pro Ser Lys Glu Glu Val Val Asn Arg Thr Lys Ala Val Phe Trp Asn 
305 310 315 320 

Gly Glu Gly Arg lie Ser Ser Leu Asn Gly Phe Tyr Gin Gly Leu Tyr 
325 330 335 

Ser Asn Asp Glu Thr Met Pro Leu Tyr Asn Asn Gly Arg Tyr His lie 
340 345 350 

Leu Pro Val lie His Glu Lys lie Asp Lys Glu Lys lie Ser Ser lie 
355 360 365 

Phe Pro Asn Ala Lys lie Leu Thr Lys Asn Ser Glu Glu Leu Ser Ser 
370 375 380 

Lys Val Asn Tyr Leu Asn Ser Leu Tyr Pro Lys Leu Tyr Glu Gly Asp 
385 390 395 400 

Gly Tyr Ala Gin Arg Val Gly Asn Ser Trp Tyr lie Tyr Asn Ser Asn 
405 410 415 

Ala Asn lie Asn Lys Asn Gin Gin Val Met Leu Pro Met Tyr Thr Asn 
420 425 430 

Asn Thr Lys Ser Leu Ser Leu Asp Leu Thr Pro His Thr Tyr Ala Val 
435 440 445 

Val Lys Glu Asn Pro Asn Asn Leu His lie Leu Leu Asn Asn Tyr Arg 
450 455 460 

Thr Asp Lys Thr Ala Met Trp Ala Leu Ser Gly Asn Phe Asp Ala Ser 
465 470 475 480 

Lys Ser Trp Lys Lys Glu Glu Leu Glu Leu Ala Asn Trp lie Ser Lys 
485 490 495 

Asn Tyr Ser lie Asn Pro Val Asp Asn Asp Phe Arg Thr Thr Thr Leu 
500 505 510 

Thr Leu Lys Gly His Thr Gly His Lys Pro Gin lie Asn lie Ser Gly 
515 520 525 

Asp Lys Asn His Tyr Thr Tyr Thr Glu Asn Trp Asp Glu Asn Thr His 
530 535 540 

Val Tyr Thr lie Thr Val Asn His Asn Gly Met Val Glu Met Ser lie 
545 550 555 560 

Asn Thr Glu Gly Thr Gly Pro Val Ser Phe Pro Thr Pro Asp Lys Phe 
565 570 575 

Asn Asp Gly Asn Leu Asn lie Ala Tyr Ala Lys Pro Thr Thr Gin Ser 
580 585 590 

Ser Val Asp Tyr Asn Gly Asp Pro Asn Arg Ala Val Asp Gly Asn Arg 
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595 600 605 

Asn Gly Asn Phe Asn Ser Gly Ser Val Thr His Thr Arg Ala Asp Asn 
610 615 620 

Pro Ser Trp Trp Glu Val Asp Leu Lys Lys Met Asp Lys Val Gly Leu 
625 630 635 640 

Val Lys lie Tyr Asn Arg Thr Asp Ala Glu Thr Gin Arg Leu Ser Asn 
645 650 655 

Phe 

(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 



CTGTGGCAAT 


CAGTCAGCTG 


CTTCCAAACA 


GTCAGCTTCA 


GGAACGATTG 


AGGTGATTTC 


60 


ACGAGAAAAT 


GGCTCTGGGA 


CACGGGGTGC 


CTTCACAGAA 


ATCACAGGGA 


TTCTCAAAAA 


120 


AGACGGTGAT 


AAAAAAATTG 


ACAACACTGC 


CAAAACAGCT 


GTGATTCAAA 


ATAGTACAGA 


180 


AGGTGTTCTC 


TCAGCAGTTC 


AAGGGAATGC 


TAATGCTATC 


GGCTACATCT 


CCTTGGGATC 


240 


TTTAACGAAA 


TCTGTCAAGG 


CTTTAGAGAT 


TGATGGTGTC 


AAGGCTAGTC 


GAGACACAGT 


300 


TTTAGATGGT 


GAATACCCTC 


TTCAACGTCC 


CTTCAACATT 


GTTTGGTCTT 


CTAATCTTTC 


360 


CAAGCTAGGT 


CAAGATTTTA 


TCAGCTTTAT 


CCACTCCAAA 


CAAGGTCAAC 


AAGTGGTCAC 


420 


AGATAATAAA 


TTTATTGAAG 


CTAAAACCGA 


AACCACGGAA 


TATACAAGCC 


AACACTTATC 


480 


AGGCAAGTTG 


TCTGTTGTAG 


GTTCCACTTC 


AGTATCTTCT 


TTAATGGAAA 


AATTAGCAGA 


540 


AGCTTATAAA 


AAAGAAAATC 


CAGAAGTTAC 


GATTGATATT 


ACCTCTAATG 


GGTCTTCAGC 


600 


AGGTATTACC 


GCTGTTAAGG 


AGAAAACCGC 


TGATATTGGT 


ATGGTTTCTA 


GGGAATTAAC 


660 


TCCTGAAGAA 


GGTAAGAGTC 


TCACCCATGA 


TGCTATTGCT 


TTAGACGGTA 


TTGCTGTTGT 


720 


GGTCAATAAT 


GACAATAAGG 


CAAGCCAAGT 


CAGTATGGCT 


GAACTTGCAG 


ACGTTTTTAG 


780 


TGGCAAATTA 


ACCACCTGGG 


ACAAGATTAA 


A 
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(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

Cys Gly Asn Gin Ser Ala Ala Ser Lys Gin Ser Ala Ser Gly Thr lie 
1 5 10 15 

Glu Val lie Ser Arg Glu Asn Gly Ser Gly Thr Arg Gly Ala Phe Thr 
20 25 30 

Glu lie Thr Gly lie Leu Lys Lys Asp Gly Asp Lys Lys lie Asp Asn 
35 40 45 

Thr Ala Lys Thr Ala Val lie Gin Asn Ser Thr Glu Gly Val Leu Ser 
50 55 60 

Ala Val Gin Gly Asn Ala Asn Ala lie Gly Tyr lie Ser Leu Gly Ser 
65 70 75 80 

Leu Thr Lys Ser Val Lys Ala Leu Glu lie Asp Gly Val Lys Ala Ser 
85 90 95 

Arg Asp Thr Val Leu Asp Gly Glu Tyr Pro Leu Gin Arg Pro Phe Asn 
100 105 110 

lie Val Trp Ser Ser Asn Leu Ser Lys Leu Gly Gin Asp Phe lie Ser 
115 120 125 

Phe lie His Ser Lys Gin Gly Gin Gin Val Val Thr Asp Asn Lys Phe 
130 135 140 

lie Glu Ala Lys Thr Glu Thr Thr Glu Tyr Thr Ser Gin His Leu Ser 
145 150 155 160 

Gly Lys Leu Ser Val Val Gly Ser Thr Ser Val Ser Ser Leu Met Glu 
165 170 175 

Lys Leu Ala Glu Ala Tyr Lys Lys Glu Asn Pro Glu Val Thr lie Asp 
180 185 190 

lie Thr Ser Asn Gly Ser Ser Ala Gly lie Thr Ala Val Lys Glu Lys 
195 200 205 

Thr Ala Asp lie Gly Met Val Ser Arg Glu Leu Thr Pro Glu Glu Gly 
210 215 220 

Lys Ser Leu Thr His Asp Ala lie Ala Leu Asp Gly lie Ala Val Val 
225 230 235 240 

Val Asn Asn Asp Asn Lys Ala Ser Gin Val Ser Met Ala Glu Leu Ala 
245 250 255 

Asp Val Phe Ser Gly Lys Leu Thr Thr Trp Asp Lys lie Lys 
260 265 270 

INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

TTGTCAACAA CAACATGCTA CTTCTGAGGG GACGAATCAA AGGCAAAGCA GTTCAGCGAA 60 

AGTTCCATGG AAAGCTTCAT ACACCAACCT AAACAACCAG GTAAGTACAG AAGAGGTCAA 120 

ATCTCTCTTA TCAGCTCACT TGGATCCAAA TAGTGTTGAT GCATTTTTTA ATCTCGTTAA 180 

TGACTATAAT ACCATTGTCG GCTCAACTGG CTTATCAGGA GATTTCACTT CCTTTACTCA 240 

CACCGAATAC GATGTTGAGA AAATCAGTCA TCTCTGGAAT CAAAAGAAGG GCGATTTTGT 300 

TGGGACCAAC TGCCGTATCA ATAGTTATTG TCTTTTGAAA AATTCAGTCA CCATTCCAAA 360 

GCTTGAAAAG AATGACCAGT TGCTTTTCCT AGATAATGAT GCGATTGATA AAGGAAAGGT 420 

CTTTGATTCA CAAGATAAGG AAGAGTTTGA TATTCTATTT TCGAGAGTTC CAACTGAGTC 480 

AACTACAGAT GTCAAGGTTC ACGCTGAAAA GATGGAAGCA TTCTTCTCAC AATTTCAATT 540 

CAATGAAAAA GCTCGAATGC TGTCTGTAGT CTTGCACGAC AATTTGGATG GCGAGTATCT 600 

GTTTGTAGGC CACGTTGGGG TCTTAGTACC TGCTGATGAC GGTTTCTTAT TTGTAGAGAA 660 

ATTGACTTTC GAAGAGCCCT ACCAAGCGAT TAAATTTGCT AGTAAGGAAG ATTGCTACAA 720 

GTATTTGGGC ACCAAGTATG CGGATTATAC AGGCGAGGGA CTGGCTAAGC CTTTTATCAT 780 

GGATAATGAT AAGTGGGTTA AACTT 805 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

Cys Gin Gin Gin His Ala Thr Ser Glu Gly Thr Asn Gin Arg Gin Ser 
15 10 15 

Ser Ser Ala Lys Val Pro Trp Lys Ala Ser Tyr Thr Asn Leu Asn Asn 
20 25 30 

Gin Val Ser Thr Glu Glu Val Lys Ser Leu Leu Ser Ala His Leu Asp 
35 40 45 

Pro Asn Ser Val Asp Ala Phe Phe Asn Leu Val Asn Asp Tyr Asn Thr 
50 55 60 

lie Val Gly Ser Thr Gly Leu Ser Gly Asp Phe Thr Ser Phe Thr His 
65 70 75 80 

Thr Glu Tyr Asp Val Glu Lys lie Ser His Leu Trp Asn Gin Lys Lys 
85 90 95 

Gly Asp Phe Val Gly Thr Asn Cys Arg lie Asn Ser Tyr Cys Leu Leu 
100 105 110 
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Lys Asn Ser Val Thr lie Pro Lys Leu Glu Lys Asn Asp Gin Leu Leu 
115 120 125 

Phe Leu Asp Asn Asp Ala lie Asp Lys Gly Lys Val Phe Asp Ser Gin 
130 135 140 

Asp Lys Glu Glu Phe Asp lie Leu Phe Ser Arg Val Pro Thr Glu Ser 
145 150 155 160 

Thr Thr Asp Val Lys Val His Ala Glu Lys Met Glu Ala Phe Phe Ser 
165 170 175 

Gin Phe Gin Phe Asn Glu Lys Ala Arg Met Leu Ser Val Val Leu His 
180 185 190 

Asp Asn Leu Asp Gly Glu Tyr Leu Phe Val Gly His Val Gly Val Leu 
195 200 205 

Val Pro Ala Asp Asp Gly Phe Leu Phe Val Glu Lys Leu Thr Phe Glu 
210 215 220 

Glu Pro Tyr Gin Ala lie Lys Phe Ala Ser Lys Glu Asp Cys Tyr Lys 
225 230 235 240 

Tyr Leu Gly Thr Lys Tyr Ala Asp Tyr Thr Gly Glu Gly Leu Ala Lys 
245 250 255 

Pro Phe lie Met Asp Asn Asp Lys Trp Val Lys Leu 
260 265 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

TTGTTCAGGC AAGTCCGTGA CTAGTGAACA CCAAACGAAA GATGAAATGA AGACGGAGCA 60 

GACAGCTAGT AAAACAAGCG CAGCTAAAGG GAAAGAGGTG GCTGATTTTG AATTGATGGG 120 

AGTAGATGGC AAGACCTACC GTTTATCTGA TTACAAGGGC AAGAAAGTCT ATCTCAAATT 180 

CTGGGCTTCT TGGTGTTCCA TCTGTCTGGC TAGTCTTCCA GATACGGATG AGATTGCTAA 240 

AGAAGCTGGT GATGACTATG TGGTCTTGAC AGTAGTGTCA CCAGGACATA AGGGAGAGCA 300 

ATCTGAAGCG GACTTTAAGA ATTGGTATAA GGGATTGGAT TATAAAAATC TCCCAGTCCT 360 

AGTTGACCCA TCAGGCAAAC TTTTGGAAAC TTATGGTGTC CGTTCTTACC CAACCCAAGC 420 

CTTTATAGAC AAAGAAGGCA AGCTGGTCAA AACACATCCA GGATTCATGG AAAAAGATGC 480 

AATTTTGCAA ACTTTGAAGG AATTAGCC 508 

(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

Cys Ser Gly Lys Ser Val Thr Ser Glu His Gin Thr Lys Asp Glu Met 
15 10 15 

Lys Thr Glu Gin Thr Ala Ser Lys Thr Ser Ala Ala Lys Gly Lys Glu 
20 25 30 

Val Ala Asp Phe Glu Leu Met Gly Val Asp Gly Lys Thr Tyr Arg Leu 
35 40 45 

Ser Asp Tyr Lys Gly Lys Lys Val Tyr Leu Lys Phe Trp Ala Ser Trp 
50 55 60 

Cys Ser lie Cys Leu Ala Ser Leu Pro Asp Thr Asp Glu lie Ala Lys 
65 70 75 80 

Glu Ala Gly Asp Asp Tyr Val Val Leu Thr Val Val Ser Pro Gly His 
85 90 95 

Lys Gly Glu Gin Ser Glu Ala Asp Phe Lys Asn Trp Tyr Lys Gly Leu 
100 105 110 

Asp Tyr Lys Asn Leu Pro Val Leu Val Asp Pro Ser Gly Lys Leu Leu 
115 120 125 

Glu Thr Tyr Gly Val Arg Ser Tyr Pro Thr Gin Ala Phe lie Asp Lys 
130 135 140 

Glu Gly Lys Leu Val Lys Thr His Pro Gly Phe Met Glu Lys Asp Ala 
145 150 155 160 

lie Leu Gin Thr Leu Lys Glu Leu Ala 
165 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 994 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

CTCGCAAATT GAAAAGGCGG CAGTTAGCCA AGGAGGAAAA GCAGTGAAAA AAACAGAAAT 60 

TAGTAAAGAC GCAGACTTGC ACGAAATTTA TCTAGCTGGA GGTTGTTTCT GGGGAGTGGA 120 

GGAATATTTC TCACGTGTTC CCGGGGTGAC GGATGCCGTT TCAGGCTATG CAAATGGTAG 180 

AGGAGAAACA ACCAAGTACG AATTGATTAA CCAAACAGGT CATGCAGAAA CCGTCCATGT 240 

CACCTATGAT GCCAAGCAAA TTTCTCTCAA GGAAATCCTG CTTCACTATT TCCGCATTAT 300 
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(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

Ser Gin He Glu Lys Ala Ala Val Ser Gin Gly Gly Lys Ala Val Lys 
15 10 15 

Lys Thr Glu He Ser Lys Asp Ala Asp Leu His Glu He Tyr Leu Ala 
20 25 30 

Gly Gly Cys Phe Trp Gly Val Glu Glu Tyr Phe Ser Arg Val Pro Gly 
35 40 45 

Val Thr Asp Ala Val Ser Gly Tyr Ala Asn Gly Arg Gly Glu Thr Thr 
50 55 60 

Lys Tyr Glu Leu He Asn Gin Thr Gly His Ala Glu Thr Val His Val 
65 70 75 80 

Thr Tyr Asp Ala Lys Gin He Ser Leu Lys Glu He Leu Leu His Tyr 
85 90 95 

Phe Arg He He Asn Pro Thr Ser Lys Asn Lys Gin Gly Asn Asp Val 
100 105 110 

Gly Thr Gin Tyr Arg Thr Gly Val Tyr Tyr Thr Asp Asp Lys Asp Leu 
115 120 125 

Glu Val He Asn Gin Val Phe Asp Glu Val Ala Lys Lys Tyr Asp Gin 
130 135 140 



202 



Pro Leu Ala Val Glu Lys Glu Asn Leu Lys Asn Phe Val Val Ala Glu 
145 150 155 160 

Asp Tyr His Gin Asp Tyr Leu Lys Lys Asn Pro Asn Gly Tyr Cys His 
165 170 175 

lie Asn Val Asn Gin Ala Ala Tyr Pro Val lie Asp Ala Ser Lys Tyr 
180 185 190 

Pro Lys Pro Ser Asp Glu Glu Leu Lys Lys Thr Leu Ser Pro Glu Glu 
195 200 205 

Tyr Ala Val Thr Gin Glu Asn Gin Thr Glu Arg Ala Phe Ser Asn Arg 
210 215 220 

Tyr Trp Asp Lys Phe Glu Ser Gly lie Tyr Val Asp lie Ala Thr Gly 
225 230 235 240 

Glu Pro Leu Phe Ser Ser Lys Asp Lys Phe Glu Ser Gly Cys Gly Trp 
245 250 . 255 

Pro Ser Phe Thr Gin Pro lie Ser Pro Asp Val Val Thr Tyr Lys Glu 
260 265 270 

Asp Lys Ser Tyr Asn Met Thr Arg Met Glu Val Arg Ser Arg Val Gly 
275 280 285 

Asp Ser His Leu Gly His Val Phe Thr Asp Gly Pro Gin Asp Lys Gly 
290 295 300 

Gly Leu Arg Tyr Cys lie Asn Ser Leu Ser lie Arg Phe lie Pro Lys 
305 310 315 320 

Asp Gin Met Glu Glu Lys Gly Thr Leu lie Tyr 
325 330 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 625 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

TTGTCAGTCA GGTTCTAATG GTTCTCAGTC TGCTGTGGAT GCTATCAAAC AAAAAGGGAA 60 

ATTAGTTGTG GCAACCAGTC CTGACTATGC ACCCTTTGAA TTTCAATCAT TGGTTGATGG 120 

AAAGAACCAG GTAGTCGGTG CAGACATCGA CATGGCTCAG GCTATCGCTG ATGAACTTGG 180 

GGTTAAGTTG GAAATCTCAA GCATGAGTTT TGACAATGTT TTGACCAGTC TTCAAACTGG 240 

TAAGGCTGAC CTAGCAGTTG CAGGAATTAG TGCTACTGAC GAGAGAAAAG AAGTCTTTGA 300 

TTTTTCAATC CCATACTATG AAAACAAGAT TAGTTTCTTG GTTCGTAAGG CTGATGTGGA 360 

AAAATACAAG GATTTAACTA GCCTAGAAAG TGCTAATATT GCAGCCCAAA AAGGGACTGT 420 

TCCAGAATCA ATGGTCAAGG AACAATTGCC AAAAGTTCAA TTAACTTCCC TAACTAATAT 480 
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GGGTGAAGCA GTCAATGAAT TGCAGGCTGG AAAAATAGAT GCTGTTCATA TGGATGAGCC 540 

TGTTGCACTT AGTTATGCTG CTAAAAACGC TGGCTTAGCT GTCGCAACTG TCAGCTTGAA 600 

GATGAAGGAC GGCGACGCCA ATGCC 625 
(2) INFORMATION FOR SEQ ID NO: 214: 

( i ) SEQUENCE ' CHARACTERISTICS : 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

Cys Gin Ser Gly Ser Asn Gly Ser Gin Ser Ala Val Asp Ala lie Lys 
15 10 15 

Gin Lys Gly Lys Leu Val Val Ala Thr Ser Pro Asp Tyr Ala Pro Phe 
20 25 30 

Glu Phe Gin Ser Leu Val Asp Gly Lys Asn Gin Val Val Gly Ala Asp 
35 40 45 

lie Asp Met Ala Gin Ala lie Ala Asp Glu Leu Gly Val Lys Leu Glu 
50 . 55 60 

lie Ser Ser Met Ser Phe Asp Asn Val Leu Thr Ser Leu Gin Thr Gly 
65 . 70 75 80 

Lys Ala Asp Leu Ala Val Ala Gly lie Ser Ala Thr Asp Glu Arg Lys 
85 90 95 

Glu Val Phe Asp Phe Ser lie Pro Tyr Tyr Glu Asn Lys lie Ser Phe 
100 105 110 

Leu Val Arg Lys Ala Asp Val Glu Lys Tyr Lys Asp Leu Thr Ser Leu 
115 120 125 

Glu Ser Ala Asn lie Ala Ala Gin Lys Gly Thr Val Pro Glu Ser Met 
130 135 140 

Val Lys Glu Gin Leu Pro Lys Val Gin Leu Thr Ser Leu Thr Asn Met 
145 150 155 160 

Gly Glu Ala Val Asn Glu Leu Gin Ala Gly Lys He Asp Ala Val His 
165 170 175 

Met Asp Glu Pro Val Ala Leu Ser Tyr Ala Ala Lys Asn Ala Gly Leu 
180 185 190 

Ala Val Ala Thr Val Ser Leu Lys Met Lys Asp Gly Asp Ala Asn Ala 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 
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TV TV m TV /^mmm A 

AAT AC TTT AC 


A A mo A A oomm 

AA i L AAGC i 1 


OAOA AmAOAm 

C ACAA i AC A 1 


OA A AOAOAOO 

C AAAC AGAC C 


O O Q A 


TTTGACTTTT 


TV TV TV ^^immm/^ TV 

AAAGTTTCAG 


TV m/^ TV TV m 

CATCAGCGAT 


TV TV /^m TV o A TV m 

AAC T AC AG AT 


m/"<m/^mTV A omo 

TCTCTAAC 1 G 


A O A O A mm AAA 

ACAGAi iAAA 


O 1 ^ A 

Z J4 U 


ACTTGATGAA 


ACATATAAAG 


TV TV TV TV TV TV fTl/^ 

ATGAAAAATC 


m/^ TV TV m/^/^m 

T C C AGATGGT 


TV TV /^/~* TV TV A fTimO 

AAGC AAA i 1 G 


mmo o A o AAA m 
i iCCAGAAAi 


O /I A A 

Z4U U 


m TV /^/^ TV TV TV 

TCACCCAGAA 


TV TV TV /^m/^ TV TV TV 

AAAGTCAAAG 


O A OO A A A m A m 

CjAGC AAA i A i 


o A o A mmmo a o 
CACAi i iGAG 


o A mo A m A omm 
CAiGAi AC J. I 


mo AomAmAOO 
iCAC iAlAGG 


O /I C A 

Z4o U 


CGCAAATTCT 


AGCTTTGATT 


mo A A moooom 


rPA rp A A A TiOTirn 

lAiAAAiGi i 


oo AO A r*r*r*r*7\. 
GGAGAGGCCA 


AAAAOAAAAA 

AAAACAAAAA 


Z 3Z U 


TAAATTTGTA 


TV TV m/~i TV mmmTV 

GAATCATTTA 


TTL A 1 1 i i CjA 


omo A omoo a a 
G i C AG 1 GGAA 


GCGA i GGAAG 


OmOm A A A OT'O 

C 1 C 1 AAAC 1 C 


O c: Q A 
Z Do U 


T^ /^/^^^^^/^TV TV 

CAGCGGGAAG 


TV TV TV TV m TV TV TV m 

AAAATAAACT 


moo A A oommo 

TLCAALC i iC 


X i iGiCGAiG 


oomomA A moo 
CC i C i AA i GG 


o A mmmoomoo 
GA 1 1 J. GC 1 GG 


Z d4U 


Tl TV mm/^/"* Tl Tl 

GAATTGGAAC 


/^TVOOTV TV/^OTV A 

CACGAACCAA 


moommo a m a a 
1 LC i i (jjA 1 AA 


A moooommoo 
AIGGGC i iGG 


O A A O A A OOOm 

GAAGAAGGG 1 


O A A O A rpO AAA 

C AAGA i C AAA 


OTA A 
Z / UU 


T% TV TV ^^m/^/^/^TV 

AACACTGGGA 


>^/^mm TV m/^ tv m/^ 

GGTTATGATG 


AmoAmoomA a 

AT(jjATCj(j i AA 


A ooo A A A A mm 
ACCGAAAAi i 


OOAOOA AOOm 

CCAGGAACC 1 


mA A AmA AOOO 

iAAAiAAGGG 


O T C A 

z / b u 


TV TV JTlrTT/*^/*im^*^/^ TV 

AATtGGTGGA 


TV Tl Tl rn/~i/^m*A 

GAACATGGTA 


m TV TV m TV A A mm 

TAG AT AAA IT 


mA AmOOAOOA 

T AA i C C AGC A 


OOA omm A m A o 
GGAGi iAlAC 


AAAAmAOAAA 

AAAA i AG AAA 


O Q O A 
Z oZ U 


TV T^ m T^ TV Ti Ti TV ITl 

AGATAAAAAT 


TV TV Tl TV m/^/^/^ 

ACAACATCCC 


moo A mo A A A A ■ 

TGGA i C AAAA 


mo o A o A A mm a 
J-CCAGAAi lA 


rnrnrrioO'TirnrpO A 

1 i iGC i 1 iCA 


AmA AOOA AOO 

A J. AAC G AAGG 


O Q Q A 
Z OO U 


GATCAACGCT 


CCATCATCAA 


GTGGTTCTAA 


GATTGCTAAC 


ATTTATCCTT 


TAGATTCAAA 


2940 


TGGAAATCCT 


CAAGATGCTC 


AACTTGAAAG 


AGGATTAACA 


CCTTCTCCAC 


TTGTATTAAG 


3000 


AAGTGCAGAA 


GAAGGATTGA 


TT 








3022 


(2) INFORMATION FOR SEQ ID NO: 216: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

Glu Thr Ser Gin Asp Phe Lys Glu Lys Lys Thr Ala Val lie Lys Glu 
15 10 15 

Lys Glu Val Val Ser Lys Asn Pro Val lie Asp Asn Asn Thr Ser Asn 
20 25 30 

Glu Glu Ala Lys lie Lys Glu Glu Asn Ser Asn Lys Ser Gin Gly Asp 
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35 40 45 

Tyr Thr Asp Ser Phe Val Asn Lys Asn Thr Glu Asn Pro Lys Lys Glu 
50 55 60 

Asp Lys Val Val Tyr lie Ala Glu Phe Lys Asp Lys Glu Ser Gly Glu 
65 70 75 80 

Lys Ala lie Lys Glu Leu Ser Ser Leu Lys Asn Thr Lys Val Leu Tyr 
85 90 95 

Thr Tyr Asp Arg lie Phe Asn Gly Ser Ala lie Glu Thr Thr Pro Asp 
100 105 110 

Asn Leu Asp Lys lie Lys Gin lie Glu Gly lie Ser Ser Val Glu Arg 
115 120 125 

Ala Gin Lys Val Gin Pro Met Met Asn His Ala Arg Lys Glu lie Gly 
130 135 140 

Val Glu Glu Ala lie Asp Tyr Leu Lys Ser lie Asn Ala Pro Phe Gly 
145 150 155 160 

Lys Asn Phe Asp Gly Arg Gly Met Val lie Ser Asn lie Asp Thr Gly 
165 170 175 

Thr Asp Tyr Arg His Lys Ala Met Arg lie Asp Asp Asp Ala Lys Ala 
180 185 190 

Ser Met Arg Phe Lys Lys Glu Asp Leu Lys Gly Thr Asp Lys Asn Tyr 
195 200 205 

Trp Leu Ser Asp Lys lie Pro His Ala Phe Asn Tyr Tyr Asn Gly Gly 
210 215 220 

Lys lie Thr Val Glu Lys Tyr Asp Asp Gly Arg Asp Tyr Phe Asp Pro 
225 230 235 240 

His Gly Met His lie Ala Gly lie Leu Ala Gly Asn Asp Thr Glu Gin 
245 250 255 

Asp lie Lys Asn Phe Asn Gly lie Asp Gly lie Ala Pro Asn Ala Gin 
260 265 270 

lie Phe Ser Tyr Lys Met Tyr Ser Asp Ala Gly Ser Gly Phe Ala Gly 
275 280 285 

Asp Glu Thr Met Phe His Ala lie Glu Asp Ser lie Lys His Asn Val 
290 295 300 

Asp Val Val Ser Val Ser Ser Gly Phe Thr Gly Thr Gly Leu Val Gly 
305 310 315 320 

Glu Lys Tyr Trp Gin Ala lie Arg Ala Leu Arg Lys Ala Gly lie Pro 
325 330 335 

Met Val Val Ala Thr Gly Asn Tyr Ala Thr Ser Ala Ser Ser Ser Ser 
340 345 350 

Trp Asp Leu Val Ala Asn Asn His Leu Lys Met Thr Asp Thr Gly Asn 
355 360 365 

Val Thr Arg Thr Ala Ala His Glu Asp Ala lie Ala Val Ala Ser Ala 
370 375 380 
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Lys. Asn Gin Thr Val Glu Phe Asp Lys Val Asn lie Gly Gly Glu Ser 
385 390 395 400 

Phe Lys Tyr Arg Asn lie Gly Ala Phe Phe Asp Lys Ser Lys lie Thr 
405 410 415 

Thr Asn Glu Asp Gly Thr Lys Ala Pro Ser Lys Leu Lys Phe Val Tyr 
420 425 430 

lie Gly Lys Gly Gin Asp Gin Asp Leu lie Gly Leu Asp Leu Arg Gly 
435 440 445 

Lys lie Ala Val Met Asp Arg lie Tyr Thr Lys Asp Leu Lys Asn Ala 
450 455 460 

Phe Lys Lys Ala Met Asp Lys Gly Ala Arg Ala lie Met Val Val Asn 
465 470 475 480 

Thr Val Asn Tyr Tyr Asn Arg Asp Asn Trp Thr Glu Leu Pro Ala Met 
485 490 495 

Gly Tyr Glu Ala Asp Glu Gly Thr Lys Ser Gin Val Phe Ser lie Ser 
500 505 510 

Gly Asp Asp Gly Val Lys Leu Trp Asn Met lie Asn Pro Asp Lys Lys 
515 520 525 

Thr Glu Val Lys Arg Asn Asn Lys Glu Asp Phe Lys Asp Lys Leu Glu 
530 535 540 

Gin Tyr Tyr Pro lie Asp Met Glu Ser Phe Asn Ser Asn Lys Pro Asn 
545 550 555 560 

Val Gly Asp Glu Lys Glu lie Asp Phe Lys Phe Ala Pro Asp Thr Asp 
565 570 . 575 

Lys Glu Leu Tyr Lys Glu Asp lie lie Val Pro Ala Gly Ser Thr Ser 
580 585 590 

Trp Gly Pro Arg lie Asp Leu Leu Leu Lys Pro Asp Val Ser Ala Pro 
595 600 605 

Gly Lys Asn lie Lys Ser Thr Leu Asn Val lie Asn Gly Lys Ser Thr 
610 615 620 

Tyr Gly Tyr Met Ser Gly Thr Ser Met Ala Thr Pro lie Val Ala Ala 
625 630 635 640 

Ser Thr Val Leu lie Arg Pro Lys Leu Lys Glu Met Leu Glu Arg Pro 
645 650 655 

Val Leu Lys Asn Leu Lys Gly Asp Asp Lys lie Asp Leu Thr Ser Leu 
660 665 670 

Thr Lys lie Ala Leu Gin Asn Thr Ala Arg Pro Met Met Asp Ala Thr 
675 680 685 

Ser Trp Lys Glu Lys Ser Gin Tyr Phe Ala Ser Pro Arg Gin Gin Gly 
690 695 700 

Ala Gly Leu lie Asn Val Ala Asn Ala Leu Arg Asn Glu Val Val Ala 
705 710 715 720 
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Thr Phe Lys Asn Thr Asp Ser Lys Gly Leu Val Asn Ser Tyr Gly Ser 
■ 725 730 735 

lie Ser Leu Lys Glu lie Lys Gly Asp Lys Lys Tyr Phe Thr lie Lys 
740 745 750 

Leu His Asn Thr Ser Asn Arg Pro Leu Thr Phe Lys Val Ser Ala Ser 
755 760 765 

Ala lie Thr Thr Asp Ser Leu Thr Asp Arg Leu Lys Leu Asp Glu Thr 
770 775 780 

Tyr Lys Asp Glu Lys Ser Pro Asp Gly Lys Gin lie Val Pro Glu lie 
785 790 795 800 

His Pro Glu Lys Val Lys Gly Ala Asn lie Thr Phe Glu His Asp Thr 
805 810 815 

Phe Thr lie Gly Ala Asn Ser Ser Phe Asp Leu Asn Ala Val lie Asn 
820 825 830 

Val Gly Glu Ala Lys Asn Lys Asn Lys Phe Val Glu Ser Phe lie His 
835 840 845 

Phe Glu Ser Val Glu Ala Met Glu Ala Leu Asn Ser Ser Gly Lys Lys 
850 855 860 

lie Asn Phe Gin Pro Ser Leu Ser Met Pro Leu Met Gly Phe Ala Gly 
865 870 875 880 

Asn Trp Asn His Glu Pro lie Leu Asp Lys Trp Ala Trp Glu Glu Gly 
885 890 895 

Ser Arg Ser Lys Thr Leu Gly Gly Tyr Asp Asp Asp Gly Lys Pro Lys 
900 905 910 

lie Pro Gly Thr Leu Asn Lys Gly lie Gly Gly Glu His Gly lie Asp 
915 920 925 

Lys Phe Asn Pro Ala Gly Val lie Gin Asn Arg Lys Asp Lys Asn Thr 
930 935 940 

Thr Ser Leu Asp Gin Asn Pro Glu Leu Phe Ala Phe Asn Asn Glu Gly 
945 950 955 960 

lie Asn Ala Pro Ser Ser Ser Gly Ser Lys lie Ala Asn lie Tyr Pro 
965 970 975 

Leu Asp Ser Asn Gly Asn Pro Gin Asp Ala Gin Leu Glu Arg Gly Leu 
980 985 990 

Thr Pro Ser Pro Leu Val Leu Arg Ser Ala Glu Glu Gly Leu lie 
995 1000 1005 

INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 





PTTPAHAPTP 


PTPAATPAAT 


AAPAAATPAP 


PAPPAAPPTA 


PPAPAPAAAA 

\J VJXiV.^ XiVJXiXiXiXi 


60 




HAnAPAfiAnn 


AAPPTPPAAA 


APAAPAAPPA 


PPTAAAAPAG 


AAG AAAGTC C 


120 


A AAnnA AnA A 


PP AAA ATP 


APnTAAAAPP 


TAP TP AP PAP 


APPPTTCCTA 


AAGTAGAAGA 


180 


CrTT^A A AP A A 


P A TTP A PP A P 


A APP APPTPP 


APTTPA AP A A 


PT A PPTPP A P 


A APTTP APTP 


240 


AAA Apr* AP AP* 


PA A A A APT AC 


P APTTAAPPP 


APAAAPTPAA 


PPATPAPAPA 


AAPPAPPTPA 

xixiV^ V_ xiVJ X VJXi 


300 


Pr'a AT'P A A A A 


PTTP A A P A a P 


P APPTPA APP 


APTPPPPPP A 


APAPAAPAPP 


AAAAPPPAPP 
xi.riiiai.vjvjiv.> xvv^, \_, 


360 




PA A A APP A AP 
AAAAoV- AAV^ 


P AP A APPTPP 


TP A APA A PAP 


A APPPTPTAP 


APPAAAPAPP 






r' APTP A APTP 
^jAVj X ^ AA^ X 


P APAT APP A A 


PPPTPA APA A 


APTPTAPA AP 


PAAAAPAPPA 


*± o V 


r^aPT'PT'PA A'P 


P A ATPTATTP 
UAAiL, XAX X Vj 


A AP A APP A A A 


APTTPA A APP 


PPTPP TPTAP 


AAAAAPAAAP 


540 

~J ri \J 


aoBar'PAAPA 


P APPA APP a A 
Lj Av3V3 AAL. AA 


A A PTTP A AP A 
AAVj X X vj/\/\V_/\ 


APP APPTPA A 


PPAPTPPPPP 


PAAPAPAAPA 


600 

W U VJ 


r*r^A APAPpp A 


PP A APPPP AP 


P APTTP APPP 


APA A A APP A A 


PP APA APTTP 


PTPAAPAAPA 

V. X \_JXi.XiVjXiXiVs_JXi 


660 


P A APPPTPTA 


P APnA A AP AP 


PPA A APP APA 


APATAAAATA 


A APPPT ATTP 

r\£\\j\JJ\J X XT. X X VJ 


PTAPTAAAPA 

V_j X xi^ X xixixi.\Jxi 


720 


a PP A PT"PP A T* 


AAA APTP APT 


T A A A T A A TP A 


AATTPATA AA 


PPT APTTP AP 

OV_> X xAVjJ X X V_.xj.VJJ 


TTTPTPPTAP 


780 


T* P A T'T' A T TP T* 


AP APP A APTT 


AP AATPPTPT 


TPPAPPTPTT 

X V7Vj.n^ X O X X 


TTAPAAAPTP 


PAAAAPGTPT 


840 


P "P A TP P TTP A 


PAPPPTPTA A 


A AP APPPTPA 


PPTAA ATAPP 


PAPAPAAATA 


AAPTTAAAAP 

T\n\^ X X rTJAxixAV_. 


900 


PPPTATTP AP 


PPTPTA A APP 
X L. X AAAv^kj 


TTP A T A A A AP 


TPA ATTA A AP 


A AT APP ATTP 


PAPATPPAAA 


960 


a ar'a a appta 

AAL. AAAVj^a 1 A 


A A AP A AP ATT 
AAAIjAA^AX X 


AP APTP A TAP 
Ak- AVj X \jr\ X AVjj 


AAPTTPPPA A 


A APPTPP AAA 


PTPA APTT AP 




a a ar^rT'TT'a a 

AAAVjok^ 1 ^jAA 


a a aPTTPP AP 

AAALr X i vjL. AVa 


PTAATAPAPA 


TPPTA A AP A A 


APTP A APTT A 


APPA APP TPT 


1 ORO 

-L \J O U 


iVjAAAAAi lA 


AL* i LrV^ AAU X A 


TTO a a a a a tt 

i iijjAAAAAX X 


rr'TTP a a tt a 

Kj\3 X 1 VjAA X X A 


TPTP A A A APP 
X X vjAAA-rilj^^ 


PA AT ATTA AP 


1 1 An 


a TT"/^ a cv a r^T 

Ail AU I A^jr X 


arTT'aTAAPA 

AL. \^ Vj A 1 AA\j A 


A A AT ATTPP A 
AAA X A X X VJOr\ 


APPTPA APPT 


PTTPPT A A PT 
O X X X r\cWj X 


AT APTPTAPA 

r\ X rW^ X V^ X rluxa 


1900 

J. Z> Lf U 


a a a tt* a a a a r* 

AAAX L-AAAAV^ 


aaaar'aAAAA 

AAAAL-AAAAA 


TP A A ATP A AT 
1 ^ AAA X A-tt. X 


PAPAPPTP A A 


TTP A A AAA AP 


PAPA APA APT 


1 9fi0 


1 A i i AA i AL. 1 


r'Tar'T^T'TTa 
ijl ALj H-t- i lA 


P a P A TP A P A A 
ALiA X VjA^.^ AA 


PPTA AP A APA 
KjKj X AAL. AAl^ A 


PA A APTATA A 
AAA\— XAX Ar\ 


dcnr^c^r att 


1 "^9 0 


TAAPAAPPTA 
X AAVjAAUV- X A 


P APTAPTAP A 
vjA^j X AL. X AV^ A 


A APA AT A PAP 


PPTATPTAPA 


APTATPATTT 
X T\ X \jr\ XXX 


APPAPAPAPP 


1380 

-L J O U 


TA APPPTP A A 


PA A APTPA A A 


PTPTAPA AAA 


TPAAAATATT 


PAATTAPATP 


TTAAAAAAPT 


1440 


TGAGCTTAAA 


AATATTAAAC 


GTACAGATTT 


AATCAAATAC 


GAAAATGGAA 


AAGAAACTAA 


1500 


TGAATCACTG 


ATAACAACTA 


TTCCTGATGA 


TAAGAGCAAT 


TATTATTTAA 


AAATAACTTC 


1560 


AAATAATCAG 


AAAACTACAT 


TACTAGCTGT 


TAAAAATATA 


GAAGAAACTA 


CGGTTAACGG 


1620 


AACACCTGTA 


TATAAAGTTA 


CAGCAATCGC 


AGACAATTTA 


GTCTCTAGAA 


CTGCTGATAA 


1680 


TAAATTTGAA 


GAAGAA 










1696 



(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

Val Val Glu Val Glu Thr Pro Gin Ser He Thr Asn GLn Glu Gin Ala 
15 10 15 

Arg Thr Glu Asn Gin Val Val Glu Thr Glu Glu Ala Pro Lys Glu Glu 
20 25 30 

Ala Pro Lys Thr Glu Glu Ser Pro Lys Glu Glu Pro Lys Ser Glu Val 
35 40 45 

Lys Pro Thr Asp Asp Thr Leu Pro Lys Val Glu Glu Gly Lys Glu Asp 
50 55 60 

Ser Ala Glu Pro Ala Pro Val Glu Glu Val Gly Gly Glu Val Glu Ser 
65 70 75 80 

Lys Pro Glu Glu Lys Val Ala Val Lys Pro Glu Ser Gin Pro Ser Asp 
85 90 95 

Lys Pro Ala Glu Glu Ser Lys Val Glu Gin Ala Gly Glu Pro Val Ala 
100 105 110 

Pro Arg Glu Asp Glu Lys Ala Pro Val Glu Pro Glu Lys Gin Pro Glu 
115 120 125 

Ala Pro Glu Glu Glu Lys Ala Val Glu Glu Thr Pro Lys Gin Glu Glu 
130 135 140 

Ser Thr Pro Asp Thr Lys Ala Glu Glu Thr Val Glu Pro Lys Glu Glu 
145 150 155 160 

Thr Val Asn Gin Ser He Glu Gin Pro Lys Val Glu Thr Pro Ala Val 
165 170 175 

Glu Lys Gin Thr Glu Pro Thr Glu Glu Pro Lys Val Glu Gin Ala Gly 
180 185 190 

Glu Pro Val Ala Pro Arg Glu Asp Glu Gin Ala Pro Thr Ala Pro Val 
195 200 205 

Glu Pro Glu Lys Gin Pro Glu Val Pro Glu Glu Glu Lys Ala Val Glu 
210 215 220 

Glu Thr Pro Lys Pro Glu Asp Lys He Lys Gly He Gly Thr Lys Glu 
225 230 235 240 

Pro Val Asp Lys Ser Glu Leu Asn Asn Gin He Asp Lys Ala Ser Ser 
245 250 255 

Val Ser Pro Thr Asp Tyr Ser Thr Ala Ser Tyr Asn Ala Leu Gly Pro 
260 265 270 

Val Leu Glu Thr Ala Lys Gly Val Tyr Ala Ser Glu Pro Val Lys Gin 
275 280 285 

Pro Glu Val Asn Ser Glu Thr Asn Lys Leu Lys Thr Ala He Asp Ala 
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290 295 300 

Leu Asn Val Asp Lys Thr Glu Leu Asn Asn Thr lie Ala Asp Ala Lys 
305 310 315 320 

Thr Lys Val Lys Glu His Tyr Ser Asp Arg Ser Trp Gin Asn Leu Gin 
325 330 335 

Thr Glu Val Thr Lys Ala Glu Lys Val Ala Ala Asn Thr Asp Ala Lys 
340 345 350 

Gin Ser Glu Val Asn Glu Ala Val Glu Lys Leu Thr Ala Thr lie Glu 
355 360 365 

Lys Leu Val Glu Leu Ser Glu Lys Pro lie Leu Thr Leu Thr Ser Thr 
370 375 380 

Asp Lys Lys lie Leu Glu Arg Glu Ala Val Ala Lys Tyr Thr Leu Glu 
385 390 395 400 

Asn Gin Asn Lys Thr Lys lie Lys Ser lie Thr Ala Glu Leu Lys Lys 
405 410 415 

Gly Glu Glu Val lie Asn Thr Val Val Leu Thr Asp Asp Lys Val Thr 
420 425 430 

Thr Glu Thr lie Ser Ala Ala Phe Lys Asn Leu Glu Tyr Tyr Lys Glu 
435 440 445 

Tyr Thr Leu Ser Thr Thr Met lie Tyr Asp Arg Gly Asn Gly Glu Glu 
450 455 460 

Thr Glu Thr Leu Glu Asn Gin Asn lie Gin Leu Asp Leu Lys Lys Val 
465 470 475 480 

Glu Leu Lys Asn lie Lys Arg Thr Asp Leu lie Lys Tyr Glu Asn Gly 
485 490 495 

Lys Glu Thr Asn Glu Ser Leu lie Thr Thr lie Pro Asp Asp Lys Ser 
500 505 510 

Asn Tyr Tyr Leu Lys lie Thr Ser Asn Asn Gin Lys Thr Thr Leu Leu 
515 520 525 

Ala Val Lys Asn lie Glu Glu Thr Thr Val Asn Gly Thr Pro Val Tyr 
530 535 540 

Lys Val Thr Ala lie Ala Asp Asn Leu Val Ser Arg Thr Ala Asp Asn 
545 550 555 560 

Lys Phe Glu Glu Glu 
565 

INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 
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AACACCTGTA TATAAAGTTA CAGCAATCGC AGACAATTTA GTCTCTAGAA CTGCTGATAA 60 

TAAATTTGAA GAAGAATACG TTCACTATAT TGAAAAACCT AAAGTCCACG AAGATAATGT 120 

ATATTATAAT TTCAAAGAAT TAGTGGAAGC TATTCAAAAC GATCCTTCAA AAGAATATCG 180 

TCTGGGACAA TCAATGAGCG CTAGAAATGT TGTTCCTAAT GGAAAATCAT ATATCACTAA 240 

AGAATTCACA GGAAAACTTT TAAGTTCTGA AGGAAAACAA TTTGCTATTA CTGAATTGGA 300 

ACATCCATTA TTTAATGTGA TAACAAACGC AACGATAAAT AATGTGAATT TTGAAAATGT 360 

AGAGATAGAA CGTTCTGGTC AAGATAATAT TGCATCATTA GCCAATACTA TGAAAGGTTC 420 

TTCAGTTATT ACAAATGTCA AAATTACAGG CACACTTTCA GGTCGTAATA ATGTTGCTGG 480 

ATTTGTAAAT AATATGAATG ATGGAACTCG TATTGAAAAT GTTGCTTTCT TTGGCAAACT 540 

ACACTCTACA AGTGGAAATG GCTCTCATAC AGGGGGAATT GCAGGTACAA ACTATAGAGG 600 

AATTGTTAGA AAAGCATATG TTGATGCTAC TATTACAGGA AACAAAACAC GCGCCAGCTT 660 

GTTAGTTCCT AAAGTAGATT ATGGATTAAC TCTAGACCAT CTTATTGGTA CAAAAGCTCT 720 

CCTAACTGAG TCGGTTGTAA AAGGTAAAAT AGATGTTTCA AATCCAGTAG AAGTTGGAGC 780 

AATAGCAAGT AAGACTTGGC CTGTAGGTAC GGTAAGTAAT TCTGTCAGCT ATGCTAAGAT 840 

TATCCGTGGA GAGGAGTTAT TCGGCTCTAA CGACGTTGAT GATTCTGATT ATGCTAGTGC 900 

TCATATAAAA GATTTATATG CGGTAGAGGG ATATTCGTCA GGTAATAGAT CATTTAGGAA 960 

ATCTAAAACA TTTACTAAAT TAACTAAAGA ACAAGCTGAT GCTAAAGTTA CTACTTTCAA 1020 

TATTACTGCT GATAAATTAG AAAGTGATCT ATCTCCTCTT GCAAAACTTA ATGAAGAAAA 1080 

AGCCTATTCT AGTATTCAAG ATTATAACGC TGAATATAAC CAAGCCTATA AAAATCTTGA 1140 

AAAATTAATA CCATTCTACA ATAAAGATTA TATTGTATAT CAAGGTAATA AATTAAATAA 1200 

AGAACACCAT CTAAATACTA AAGAAGTTCT TTCTGTTACC GCGATGAACA ACAATGAGTT 1260 

TATCACAAAC CTAGATGAAG CTAATAAAAT TATTGTTCAC TATGCGGACG GTACAAAAGA 1320 

TTACTTTAAC TTGTCTTCTA GCAGTGAAGG TTTAAGTAAT GTAAAAGAAT ATACTATAAC 1380 

TGACTTAGGA ATTAAATATA CACCTAATAT CGTTCAAAAA GATAACACTA CTCTTGTTAA 1440 

TGATATAAAA TCTATTTTAG AATCAGTAGA GCTTCAGTCT CAAACGATGT ATCAGCATCT 1500 

AAATCGATTA GGTGACTATA GAGTTAATGC AATCAAAGAT TTATATTTAG AAGAAAGCTT 1560 

CACAGATGTT AAAGAAAACT TAACAAACCT AATCACAAAA TTAGTTCAAA ACGAAGAACA 1620 

TCAACTAAAT GATTCTCCAG CTGCTCGTCA AATGATTCGT GATAAAGTCG AGAAAAACAA 1680 

AGCAGCTTTA TTACTAGGTT TAACTTACCT AAATCGTTAC TATGGAGTTA AATTTGGTGA 1740 

TGTTAATATT AAAGAATTAA TGCTATTCAA ACCAGATTTC TATGGTGAAA AAGTTAGCGT 1800 

ATTAGACAGA TTAATTGAAA TCGGTTGTAA AGAGAACAAC ATTAAAGGTT CACGTACATT 1860 

CGACGCATTC GGTCAAGTA 1879 
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INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS:- 

(A) LENGTH: 626 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

Thr Pro Val Tyr Lys Val Thr Ala lie Ala Asp Asn Leu Val Ser Arg 
1 5 10 15 

Thr Ala Asp Asn Lys Phe Glu Glu Glu Tyr Val His Tyr lie Glu Lys 
20 25 30 

Pro Lys Val His Glu Asp Asn Val Tyr Tyr Asn Phe Lys Glu Leu Val 
35 40 45 

Glu Ala lie Gin Asn Asp Pro Ser Lys Glu Tyr Arg Leu Gly Gin Ser 
50 55 60 

Met Ser Ala Arg Asn Val Val Pro Asn Gly Lys Ser Tyr lie Thr Lys 
65 70 75 80 

Glu Phe Thr Gly Lys Leu Leu Ser Ser Glu Gly Lys Gin Phe Ala lie 
85 90 95 

Thr Glu Leu Glu His Pro Leu Phe Asn Val lie Thr Asn Ala Thr lie 
100 105 110 

Asn Asn Val Asn Phe Glu Asn Val Glu lie Glu Arg Ser Gly Gin Asp 
115 120 125 

Asn lie Ala Ser Leu Ala Asn Thr Met Lys Gly Ser Ser Val lie Thr 
130 135 140 

Asn Val Lys lie Thr Gly Thr Leu Ser Gly Arg Asn Asn Val Ala Gly 
145 150 155 160 

Phe Val Asn Asn Met Asn Asp Gly Thr Arg lie Glu Asn Val Ala Phe 
165 170 175 

Phe Gly Lys Leu His Ser Thr Ser Gly Asn Gly Ser His Thr Gly Gly 
180 185 190 

lie Ala Gly Thr Asn Tyr Arg Gly lie Val Arg Lys Ala Tyr Val Asp 
195 200 205 

Ala Thr lie Thr Gly Asn Lys Thr Arg Ala Ser Leu Leu Val Pro Lys 
210 215 220 

Val Asp Tyr Gly Leu Thr Leu Asp His Leu lie Gly Thr Lys Ala Leu 
225 230 235 240 

Leu Thr Glu Ser Val Val Lys Gly Lys lie Asp Val Ser Asn Pro Val 
245 250 255 

Glu Val Gly Ala He Ala Ser Lys Thr Trp Pro Val Gly Thr Val Ser 
260 265 270 
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Asn Ser Val Ser Tyr Ala Lys lie lie Arg Gly Glu Glu Leu Phe Gly 
275 280 285 

Ser Asn Asp Val Asp Asp Ser Asp Tyr Ala Ser Ala His lie Lys Asp 
290 295 300 

Leu Tyr Ala Val Glu Gly Tyr Ser Ser Gly Asn Arg Ser Phe Arg Lys 
305 310 315 320 

Ser Lys Thr Phe Thr Lys Leu Thr Lys Glu Gin Ala Asp Ala Lys Val 
325 330 335 

Thr Thr Phe Asn lie Thr Ala Asp Lys Leu Glu Ser Asp Leu Ser Pro 
340 345 350 

Leu Ala Lys Leu Asn Glu Glu Lys Ala Tyr Ser Ser lie Gin Asp Tyr 
355 360 365 

Asn Ala Glu Tyr Asn Gin Ala Tyr Lys Asn Leu Glu Lys Leu lie Pro 
370 375 380 

Phe Tyr Asn Lys Asp Tyr lie Val Tyr Gin Gly Asn Lys Leu Asn Lys 
385 390 395 400 

Glu His His Leu Asn Thr Lys Glu Val Leu Ser Val Thr Ala Met Asn 
405 410 415 

Asn Asn Glu Phe lie Thr Asn Leu Asp Glu Ala Asn Lys lie lie Val 
420 425 430 

His Tyr Ala Asp Gly Thr Lys Asp Tyr Phe Asn Leu Ser Ser Ser Ser 
435 440 445 

Glu Gly Leu Ser Asn Val Lys Glu Tyr Thr lie Thr Asp Leu Gly lie 
450 455 460 

Lys Tyr Thr Pro Asn lie Val Gin Lys Asp Asn Thr Thr Leu Val Asn 
465 470 475 480 

Asp lie Lys Ser lie Leu Glu Ser Val Glu Leu Gin Ser Gin Thr Met 
485 490 495 

Tyr Gin His Leu Asn Arg Leu Gly Asp Tyr Arg Val Asn Ala lie Lys 
500 505 510 

Asp Leu Tyr Leu Glu Glu Ser Phe Thr Asp Val Lys Glu Asn Leu Thr 
515 520 525 

Asn Leu lie Thr Lys Leu Val Gin Asn Glu Glu His Gin Leu Asn Asp 
530 535 540 

Ser Pro Ala Ala Arg Gin Met lie Arg Asp Lys Val Glu Lys Asn Lys 
545 550 555 560 

Ala Ala Leu Leu Leu Gly Leu Thr Tyr Leu Asn Arg Tyr Tyr Gly Val 
565 570 575 

Lys Phe Gly Asp Val Asn lie Lys Glu Leu Met Leu Phe Lys Pro Asp 
580 585 590 

Phe Tyr Gly Glu Lys Val Ser Val Leu Asp Arg Leu lie Glu lie Gly 
595 600 605 
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Ser Lys Glu Asn Asn lie Lys Gly Ser Arg Thr Phe Asp Ala Phe Gly 
610 615 620 

Gin Val 
625 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



TV mm 7\ O 7\ O 7\ O 7\ 

ATT ACj AL AKjA 


i 1 AA i i (j AAA 


mOi^ommomA A 

i LGG i i L i AA 


A/^A/^'AAOAA^ 

ACj Atj AAL. AAL- 


Al IAAALjoI i 


UAL.LjiAL.Al i 


D u 


CGACGCATTC 


f~^r~*rr^c> TV TV /^m TV m 
GG i L AAG i A i 


i (jVjL. 1 AAA J. A 


T* A r"T>A A A rnr* A 

I AL. i AAA 1 L. A 


OOrpTv A'T"T"T'Ar' 


AiLrL-Ai i i i i 




TV TV TV mm TV m 7\ tv m 

AAATTATAAT 


TV TV o 7\ TV rTim/~»rn 

AGACAATTGT 


mo AOA A AmAT> 

iGAGAAAiAi 


A O A /"* A A rn A rno 
ACjAL, AA i A 1 Lj 


AAL-UAi iVjLii 


mm A mrpO A ^T^C^f^ 

i iAi iLjAiLjL- 


1 fin 


m"»V TV TV TV TV /"t 

TACAGAAGAC 


/-*-< T» mo mo m tv o tv 

CATGTCTACA 


mo OO TV O TV TV O O 

TL GG AGAAG G 


o oommo mo a o 


/^mO/*^ A A <^ A A A 

Vj i L-OAALjAAA 


rpnna a a a aTinnr^ 
i iAAAAAi iU 


o /I n 
Z 4 u 


m 7\ TV TV TV mo 

TAAAC ATC GT 


/~»^ TV mmoo TV m A 

GL ATT L GAT A 


A mmrn A a a a O/"* 
All lAAAAHa 


A A omo A oomm 
AAVjiUAL-Ul i 


A A A A T* A /^T* A 

AVj AAA i AL. i A 


i AL- i UL-L- AL- i 


7 n n 
J u u 


TV /"t my"* TV TV m TV mm 

AC TG AAT ATT 


/"l TV mTV TV TV TV 

GATAAAGCAC 


ATC TTT ATTT 


A A mmTi/"' A A A rn 

AAi 1 iCAAAi 


i A i AA i LjV^ AA 


mrn o o o rnrp rpo o 
i i LjL-L- i i i LxLr 


J D u 


rpTv /~»moo TV o 7\ 

1 AG 1 GL AGACj 


o o TV mm TV oom A 


A A A A A m/"* A T^T' 

AAAAA i A i i 


Ar'A AOAT^A TTT* 

ALjAAVjAIAI 1 


AAAvjA i A i 


T'T'a ar*a a apr' 

i iAAV^AAAvaL^ 


z u 


TGCAGATGGT 


mTV mTV o TV TV TV /^m 

T AT AG AAAC T 


TV mm TV m/*^ tv mrrirn 

ATTA i GA i 1 1 


/-imoom A moom 
G IGGi AiLG i 


L i ALiL.G X L- i Li 


A rp A A /^(^TlTlA A 

AiAALLji iAA 


/ion 


TV O TV TV /"»/"< TV O m TV 

AC AACGAC TA 


GTAAGAGA i G 


L. iGi i Ai ICG 


Ti A TITITI/"'/^/^ A A 


oomm A m A a oo 
LrLil lAiAAL-Li 


o rpo o rpo o rno 

\^ i i LjLj i LrLj 




TV rn^/"'/"'rnrn/^ tv tv 

ATGGG 1 1 GAA 


AAAi AiGGLG 


CjU 1 a i AA J. AL. 


r*r^ A O A A A r"T» A 

CVjAL. AAALi 1 A 


1 Ai AL- iUL. XL. 


T'T' a p a p a a 'pt' 

i iALrALiAAi i 


n n 

D U u 


CTTTGGTCCT 


ATGGATAAGT 


ATTATAATTA 


TAATGGAACA 


GGAGCTTATG 


CTGCTATATA 


660 


TCCTAACTCT 


GATGATATTA 


GAACTGATGT 


AAAATATGTT 




TGGTTGGTGA 


720 


ATACGGTATT 


TCAGTTTACA 


CACATGAAAC 


AACACACGTC 


AACGACCGTG 


CGATTTACTT 


780 


AGGTGGCTTT 


GGACACCGTG 


AAGGTACTGA 


TGCTGAAGCA 


TATGCTCAGG 


GTATGCTACA 


840 


AACTCCTGTT 


ACTGGTAGTG 


GATTTGATGA 


GTTTGGTTCT 


TTAGGTATTA 


ATATGGTATT 


900 


TAAACGCAAA 


AATGATGGGA 


ATCAGTGGTA 


TATTACAGAT 


CCAAAAACTC 




960 


AGAAGATATT 


AATAGATATA 


TGAAGGGTTA 


TAATGACACT 


TTAACTCTTC 


TTGATGAAAT 


1020 


TGAGGCTGAA 


TCTGTGATTT 


CTCAACAAAA 


TAAAGATTTA 


AATAGTGCAT 


GGTTCAAAAA 


1080 


AATAGATAGA 


GAATACCGTG 


ATAACAATAA 


ATTAAATCAA 


TGGGATAAAA 


TTCGAAATCT 


1140 


AAGTCAAGAA 


GAGAAAAATG 


AATTAAATAT 


TCAATCTGTT 


AATGATTTAG 


TTGATCAACA 


1200 


ATTAATGACT 


AATCGCAATC 


CAGGTAATGG 


TATCTATAAA 


CCCGAAGCAA 


TTAGCTATAA 


1260 


CGATCAATCA 


CCTTATGTAG 


GTGTTAGAAT 


GATGACCGGT 


ATCTACGGAG 


GTAATACTAG 


1320 


TAAAGGTGCT 


CCTGGAGCTG 


TTTCATTCAA 


ACATAATGCT 


TTTAGATTAT 


GGGGTTACTA 


1380 



216 



CGGATACGAA AATGGGTTCT TAGGTTATGC TTCAAATAAA TATAAACAAC AATCTAAAAC 1440 

AGATGGTGAG TCTGTTGTAA GTGATGAATA TATTATCAAG AAAATATCTA ACAATACATT 1500 

TAATACTATT GAAGAATTTA AAAAAGCTTA CTTCAAAGAA GTTAAAGATA AAGCAACGAA 1560 

AGGATTAACA ACATTCGAAG TAAATGGTTC TTCCGTTTCA TCATACGATG ATTTACTGAC 1620 

ATTGTTTAAA GAAGCTGTTA AAAAAGATGC CGAAACTCTT AAACAAGAAG CAAACGGTAA 1680 

TAAAACAGTA TCTATGAATA ATACAGTTAA ATTAAAAGAA GCTGTTTATA AGAAACTTCT 1740 

TCAACAAACA AATAGCTTTA AAACTTCAAT CTTTAAA 1777 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

Leu Asp Arg Leu lie Glu lie Gly Ser Lys Glu Asn Asn lie Lys Gly 
1 5 10 15 ^ 

Ser Arg Thr Phe Asp Ala Phe Gly Gin Val Leu Ala Lys Tyr Thr Lys 
20 25 30 

Ser Gly Asn Leu Asp Ala Phe Leu Asn Tyr Asn Arg Gin Leu Phe Thr 
35 40 45 

Asn lie Asp Asn Met Asn Asp Trp Phe lie Asp Ala Thr Glu Asp His 
50 55 60 

Val Tyr lie Ala Glu Arg Ala Ser Glu Val Glu Glu lie Lys Asn Ser 
65 70 75 80 

Lys His Arg Ala Phe Asp Asn Leu Lys Arg Ser His Leu Arg Asn Thr 
85 90 95 

lie Leu Pro Leu Leu Asn lie Asp Lys Ala His Leu Tyr Leu lie Ser 
100 105 110 

Asn Tyr Asn Ala lie Ala Phe Gly Ser Ala Glu Arg Leu Gly Lys Lys 
115 120 125 

Ser Leu Glu Asp lie Lys Asp lie Val Asn Lys Ala Ala Asp Gly Tyr 
130 135 140 

Arg Asn Tyr Tyr Asp Phe Trp Tyr Arg Leu Ala Ser Asp Asn Val Lys 
145 150 155 160 

Gin Arg Leu Leu Arg Asp Ala Val lie Pro lie Trp Glu Gly Tyr Asn 
165 170 175 

Ala Pro Gly Gly Trp Val Glu Lys Tyr Gly Arg Tyr Asn Thr Asp Lys 
180 185 190 

Val Tyr Thr Pro Leu Arg Glu Phe Phe Gly Pro Met Asp Lys Tyr Tyr 
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195 200 205 

Asn Tyr Asn Gly Thr Gly Ala Tyr Ala Ala lie Tyr Pro Asn Ser Asp 
210 215 220 

Asp He Arg Thr Asp Val Lys Tyr Val His Leu Glu Met Val Gly Glu 
225 230 235 240 

Tyr Gly He Ser Val Tyr Thr His Glu Thr Thr His Val Asn Asp Arg 
245 250 255 

Ala He Tyr Leu Gly Gly Phe Gly His Arg Glu Gly Thr Asp Ala Glu 
260. 265 270 

Ala Tyr Ala Gin Gly Met Leu Gin Thr Pro Val Thr Gly Ser Gly Phe 
275 280 285 

Asp Glu Phe Gly Ser Leu Gly He Asn Met Val Phe Lys Arg Lys Asn 
290 295 300 

Asp Gly Asn Gin Trp Tyr He Thr Asp Pro Lys Thr Leu Lys Thr Arg 
305 310 315 320 

Glu Asp He Asn Arg Tyr Met Lys Gly Tyr Asn Asp Thr Leu Thr Leu 
325 330 335 

Leu Asp Glu He Glu Ala Glu Ser Val He Ser Gin Gin Asn Lys Asp 
340 345 350 

Leu Asn Ser Ala Trp Phe Lys Lys He Asp Arg Glu Tyr Arg Asp Asn 
355 360 365 

Asn Lys Leu Asn Gin Trp Asp Lys He Arg Asn Leu Ser Gin Glu Glu 
370 375 380 

Lys Asn Glu Leu Asn He Gin Ser Val Asn Asp Leu Val Asp Gin Gin 
385 390 395 400 

Leu Met Thr Asn Arg Asn Pro Gly Asn Gly He Tyr Lys Pro Glu Ala 
405 410 415 

He Ser Tyr Asn Asp Gin Ser Pro Tyr Val Gly Val Arg Met Met Thr 
420 425 430 

Gly He Tyr Gly Gly Asn Thr Ser Lys Gly Ala Pro Gly Ala Val Ser 
435 440 445 

Phe Lys His Asn Ala Phe Arg Leu Trp Gly Tyr Tyr Gly Tyr Glu Asn 
450 455 460 

Gly Phe Leu Gly Tyr Ala Ser Asn Lys Tyr Lys Gin Gin Ser Lys Thr 
465 470 475 480 

Asp Gly Glu Ser Val Leu Ser Asp Glu Tyr He He Lys Lys He Ser 
485 490 495 

Asn Asn Thr Phe Asn Thr He Glu Glu Phe Lys Lys Ala Tyr Phe Lys 
500 505 510 

Glu Val Lys Asp Lys Ala Thr Lys Gly Leu Thr Thr Phe Glu Val Asn 
515 520 525 

Gly Ser Ser Val Ser Ser Tyr Asp Asp Leu Leu Thr Leu Phe Lys Glu 
530 535 540 
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Ala Val Lys Lys Asp Ala Glu Thr Leu Lys Gin Glu Ala Asn Gly Asn 
545 550 555 560 

Lys Thr Val Ser Met Asn Asn Thr Val Lys Leu Lys Glu Ala Val Tyr 
565 570 575 



Lys Lys Leu Leu Gin Gin Thr Asn Ser Phe Lys Thr Ser lie Phe Lys 
580 585 590 

(2) INFORMATION FOR SEQ ID NO: 223: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

TAAGACAGAT GAACGGAGCA AGGTGTTTGA CTTTTCCATT CCCTACTATA CTGCAAAAAA 60 

TAAACTCATT GTCAAAAAAT CTGACTTGAC TACTTATCAG TCTGTAAACG ACTTGGCGCA 120 

GAAAAAGGTT GGAGCGCAGA AAGGTTCGAT TCAAGAGACG ATGGCGAAAG ATTTGCTACA 180 

AAATTCTTCC CTCGTATCTC TGCCTAAAAA TGGGAATTTA ATCACAGATT TAAAATCAGG 240 

ACAAGTGGAT GCCGTTATCT TTGAAGAACC TGTTTCCAAG GGATTTGTGG AAAATAATCC 300 

TGATTTAGCA ATCGCAGACC TCAATTTTGA AAAAGAGCAA GATGATTCCT ACGCGGTAGC 360 

CATgAAAAAA GATAGCAAGA AATTGAAGAG GCAGTTCGAT AAAACCATTC AAAAGTTGAA 420 

GGAGTCTGGG GAATTAGACA AACTCATTGA GGAAGCCTTA 460 
(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Lys Thr Asp Glu Arg Ser Lys Val Phe Asp Phe Ser lie Pro Tyr Tyr 
15 10 15 

Thr Ala Lys Asn Lys Leu lie Val Lys Lys Ser Asp Leu Thr Thr Tyr 
20 25 30 

Gin Ser Val Asn Asp Leu Ala Gin Lys Lys Val Gly Ala Gin Lys Gly 
35 40 45 

Ser lie Gin Glu Thr Met Ala Lys Asp Leu Leu Gin Asn Ser Ser Leu 
50 55 60 

Val Ser Leu Pro Lys Asn Gly Asn Leu lie Thr Asp Leu Lys Ser Gly 
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. .■■■■■1 

65 70 75 80 J 

Gin Val Asp Ala Val lie Phe Glu Glu Pro Val Ser Lys Gly Phe Val 1 
85 90 95 ^ 

Glu Asn Asn Pro Asp Leu Ala lie Ala Asp Leu Asn Phe Glu Lys Glu i 
100 105 110 I 

Gin Asp Asp Ser Tyr Ala Val Ala Met Lys Lys Asp Ser Lys Lys Leu 
115 120 125 

Lys Arg Gin Phe Asp Lys Thr lie Gin Lys Leu Lys Glu Ser Gly Glu 
130 135 140 

Leu Asp Lys Leu lie Glu Glu Ala Leu 
145 150 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 



CTGTGAGAAT 


CAAGCTACAC 


CCAAAGAGAC 


TAGCGCTCAA 


AAGACAATCG 


TCCTTGCTAC 


60 


AGCTGGCGAC 


GTGCCACCAT 


TTGACTACGA 


AGACAAGGGC 


AATCTGACAG 


GCTTTGATAT 


120 


CGAAGTTTTA 


AAGGCAGTAG 


ATGAAAAACT 


CAGCGACTAC 


GAGATTCAAT 


TCCAAAGAAC 


180 


CGCCTGGGAG 


AGCATCTTCC 


CAGGACTTGA 


TTCTGGTCAC 


TATCAGGCTG 


CGGCCAATAA 


240 


CTTGAGTTAC 


ACAAAAGAGC 


GTGCTGAAAA 


ATACCTTTAC 


TCGCTTCCAA 


TTTCCAACAA 


300 


TCCCCTCGTC 


CTTGTCAGCA 


ACAAGAAAAA 


TCCTTTGACT 


TCTCTTGACC 


AGATCGCTGG 


360 


TAAAACAACA 


CAAGAGGATA 


CCGGAACTTC 


TAACGCTCAA 


TTCATCAATA 


ACTGGAATCA 


420 


GAAACACACT 


GATAATCCCG 


CTACAATTAA 


TTTTTCTGGT 


GAGGATATTG 


GTAAACGAAT 


480 


CCTAGACCTT 


GCTAACGGAG 


AGTTTGATTT 


CCTAGTTTTT 


GACAAGGTAT 


CCGTTCAAAA 


540 


GATTATCAAG 


GACCGTGGTT 


TAGACCTCTC 


AGTCGTTGAT 


TTACCTTCTG 


CAGATAGCCC 


600 


CAGCAATTAT 


ATCATTTTCT 


CAAGCGACCA 


AAAAGAGTTT 


AAAGAGCAAT 


TTGATAAAGC 


660 


GCTCAAAGAA 


CTCTATCAAG 


ACGGAACCCT 


TGAAAAACTC 


AGCAATACCT 


ATCTAGGTGG 


720 


TTCTTACCTC 


CCAGATCAAT 


CTCAGTTACA 


A 






751 



(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:226: 

Cys Glu Asn Gin Ala Thr Pro Lys Glu Thr Ser Ala Gin Lys Thr He 
1 5 10 15 

Val Leu Ala Thr Ala Gly Asp Val Pro Pro Phe Asp Tyr Glu Asp Lys 
20 25 30 

Gly Asn Leu Thr Gly Phe Asp He Glu Val Leu Lys Ala Val Asp Glu 
35 40 45 

Lys Leu Ser Asp Tyr Glu He Gin Phe Gin Arg Thr Ala Trp Glu Ser 
50 55 60 

He Phe Pro Gly Leu Asp Ser Gly His Tyr Gin Ala Ala Ala Asn Asn 
65 70 75 80 

Leu Ser Tyr Thr Lys Glu Arg Ala Glu Lys Tyr Leu Tyr Ser Leu Pro 
85 90 95 

He Ser Asn Asn Pro Leu Val Leu Val Ser Asn Lys Lys Asn Pro Leu 
100 105 110 

Thr Ser Leu Asp Gin He Ala Gly Lys Thr Thr Gin Glu Asp Thr Gly 
115 120 125 

Thr Ser Asn Ala Gin Phe He Asn Asn Trp Asn Gin Lys His Thr Asp 
130 135 140 

Asn Pro Ala Thr He Asn Phe Ser Gly Glu Asp He Gly Lys Arg He 
145 150 155 160 

Leu Asp Leu Ala Asn Gly Glu Phe Asp Phe Leu Val Phe Asp Lys Val 
165 170 175 

Ser Val Gin Lys He He Lys Asp Arg Gly Leu Asp Leu Ser Val Val 
180 185 190 

Asp Leu Pro Ser Ala Asp Ser Pro Ser Asn Tyr He He Phe Ser Ser 
195 200 205 

Asp Gin Lys Glu Phe Lys Glu Gin Phe Asp Lys Ala Leu Lys Glu Leu 
210 215 220 

Tyr Gin Asp Gly Thr Leu Glu Lys Leu Ser Asn Thr Tyr Leu Gly Gly 
225 230 235 240 

Ser Tyr Leu Pro Asp Gin Ser Gin Leu Gin 
245 250 

INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 
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GACTGGATCC TAAAATCTAC GACAATAAAA ATC 33 
(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 
CTGAGTCGAC TGGTTGTGCT GGTTGAG 27 
(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 
GTCAGGATCC AAATTACAAT ACGGACTATG 3 0 

(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
CAGTGTCGAC TAACTCTAGG TCGGAAAC 28 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
GACTGGATCC TGAGAATCAA GCTACACCCA AAGAG 35 
(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



222 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
AGTCAAGCTT TTGTAACTGA GATTGATCTG G 
(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 
GACTGGATCC TGGTAACCGC TCTTCTCGTA ACGCAGC 
(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 
AGTCAAGCTT TTTCAGGAAC TTTTACGCTT CC 
(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
AGTCAGATCT TGTGGAAATT TGACAGGTAA CAGCAAAAAA GCTGC 
(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 
ACTGAAGCTT TTTTGTTTTT CAAGAATTCA TCG 



223 



(2) INFORMATION FO|t SpQr' ift: ^0 : ^37: 

(i) SEQUENCE CHARACTERISTIGS : 

(A) LENGTH: 35 base pairs 

(B) TYPEi nucleic acid 
■(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC TGGTCAAGGA ACTGCTTCTA AAGAC 
(2) INFORMATION FOR^^SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCAAGCTT TCACAAATTC GTTGGTGAAG CC 
(2) INFORMATION FOR SEQ ID NO: 239: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC TAGCTCAGGT GGAAACGCTG GTTCATCC 
(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCAAGCTT ATCAACTTTT CCACCTTCAA CAACC 
(2) INFORMATION FOR SEQ ID NO : 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 
GTCAAGATCT CTCCAACTAT GGTAAATCTG CGGATGG 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 
AGTCCTGCAG ATCCACATCC GCTTTCATCG GGTTAAAGAA GG 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
GACTGGATCC TGGGAAAAAT TCTAGCGAAA CTAGTGG 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 
GTCACTGCAG CTGTCCTTCT TTTACTTCTT TGGTTGC 
(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245 
GACTGGATCC TGCTAGCGGA AAAAAAGATA CAACTTCTGG 
(2) INFORMATION FOR SEQ ID NO: 246: 



225 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY:" linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 
CTGAAAGCTT TTTTGCCAAT CCTTCAGCAA TCTTGTC 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 
GACTAGATCT TGGCTCAAAA AATACAGCTT CAAGTCC 
(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 
AGTCCTGCAG GTTTTTGTTT GCTTGGTATT GGTCG 
(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 
GACTGGATCC TAGTACAAAC TCAAGCACTA GTCAGACAGA G 
(2) INFORMATION FOR SEQ ID NO : 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 
CAGTCTGCAG TTTCAAAGCT TTTTGTATGT CTTC 
(2) INFORMATION FOR SEQ ID NO : 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251 
GACTGGATCC TGGCAATTCT GGCGGAAGTA AAGATGC 
(2) INFORMATION FOR SEQ ID NO : 252: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 
AGTCAAGCTT GTTTCATAGC TTTTTTGATT GTTTCG 
(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 
GACTGGATCC TTCACAAGAA AAAACAAAAA ATGAAGATGG 
(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 

AGTCAAGCTT ATCGACGTAG TCTCCGCCTT C 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
GACTGGATCC GAAAGGTCTG TGGTCAAATA ATCTTACC 38 
(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 
AGTCAAGCTT AGAGTTAACA TGGTGCTTGC CAATAGG 37 
(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 
GACTGGATCC AAACTCAGAA AAGAAAGCAG ACAATGC 37 
(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
AGTCAAGCTT CCAAACTGGT TGATCCAAAC CATCTG 36 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 259: 
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GAGTGGATCC TTCGAAAGGG TGAGAAGGTG GAGACC 36 ^^^'^^^^^^^^ 

(2) INFORMATION FOR SEQ ID NO: 260: ,V;^ 

(i) SEQUENCE CHARACTERISTICS: 4 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid ' ' 

(C) STRANDEDNESS : double f 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 
AGTCAAGCTT CTGTAGGCTT GGTGTGCCCC AGTTGC 3 6 

(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 
CTGAGGATCC GGGGATGGCA GCTTTTAAAA ATC 33 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
CAGTAAGCTT GTTTACCCAT TCACCATTAC C 31 
(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 
CAGTGGATCC AGACGAGCAA AAAATTAAG 29 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
TCAGAAGCTT GTTTACCCAT TCACCATT 28 
(2) INFORMATION FOR SEQ ID NO: 2 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNESS : double 

(D) TOPOLOGY: linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
GACTGGATCC CTGTGGTGAG GAAGAAACTA AAAAG 35 
(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
CTGAGTCGAC AATATTCTGT AGGAATGCTT CGAATTTG 38 
(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
CTGAGGATCC GACTTTTAAC AATAAAACTA TTGAAGAG 38 
(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 
GTCACTGCAG GTTGTCACCT CCAAAAATCA CGG 
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(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 69: 
GACTGGATCC CTTTACAGGT AAACAACTAC AAGTCGG 37 
(2) INFORMATION FOR SEQ ID NO : 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 
CAGTAAGCTT TTCGAAGTTT GGCTCAGAAT TG 32 
(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 
GACTGGATCC CCAGGCTGAT ACAAGTATCG CA 32 
(2) INFORMATION FOR SEQ ID NO : 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
CAGTAAGCTT ATCTGCAGTA TGGCTAGATG G 31 
(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 273 
GACTGGATCC GTCTGTATCA TTTGAAAACA AAGAAAC 
(2) INFORMATION FOR SEQ ID NO : 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 
CAGTCTGCAG TTTTACTGTT GCTGTGCTTG TG 
(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 
ACTGAGATCT TGGTCAAAAG GAAAGTCAGA CAGGAAAGG 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 
CAGTAAGCTT ATTCCTGAGC TTTTTTGATA AAGGTTGCGC A 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 
ACTGGGATCC GAAGGATAGA TATATTTTAG CATTTGAGAC 
(2) INFORMATION FOR SEQ ID NO: 278: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs \ 

(B) TYPE: nucleic acid J 

(C) STRANDEDNESS : double /4 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 
AGTCAAGCTT CCATGGTATC AAAGGCAAGA CTTGG 35 
(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 
GTCAGGATCC GGTAGTTAAA GTTGGTATTA ACGG 34 
(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 
AGTCAAGCTT GCAATTTTTG CGAAGTATTC CAAGAG 36 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 
AGTCGGATCC TTCTTACGAG TTGGGACTGT ATCAAGC 37 
(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 
AGTCAAGCTT GTTTATTTTT TCCTTACTTA CAGATGAAGG 
(2) INFORMATION 'FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 
AGTCGGATCC TACTGAGATG CATCATAATC TAGGAGC 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 
TCAGCTCGAG TTCTTTGACA TCTCCATCAT AAGTCGC 
(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 
GACTGGATCC GGTTTTGAGA AAGTATTTGC AGGGG 
(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 286 

CAGTAAGCTT GGATTTTTTC ATGGATGCAA TTTTTTTGG 

(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
GACTGGATCC GACAACATTT ACTATCCATA CAGTAGAGTC AGC 
(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
GACTAAGCTT GGCATAAGGT TGCAATTCTG GATTAATTGG 
(2) INFORMATION FOR SEQ ID NO : 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 
GACTGGATCC GGCTAAGGAA AGAGTGGATG 
(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 
GACTAAGCTT TTCATTTTTA AATTGACTAT GCGCCCG 
(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 



235 



■i r 

GACTGGATCC TTGTTGCTAT GAACTTGGTC GTCACC 3 6 'p 

(2) INFORMATION FOR SEQ ID NO: 292: '% 

(i) SEQUENCE CHAIUVCTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 
CATGAAGCTT ATCCTGGATT TTTCCAAGTA AATCT 35 
(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
GACTGGATCC TTATAAGGGT GAATTAGAAA AAGG 34 
(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 
GACTAAGCTT CTTATTAGGA TTGTTAGTAG TTG 33 
(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
GACTGGATCC GAATGTTCAG GCTCAAGAAA GTTCAGG 37 
(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296 
GACTAAGCTT TTCCCCTGAT GGAGCAAAGT AATACC 
(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 
GACTGGATCC CTTGGGTGTA ACCCATATCC AGCTCCTTCC 
(2) INFORMATION FOR SEQ ID NO : 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 
GACTGTCGAC TTCAGCTTGT TTATCTGGGG TTGC 
(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299 
GACTGGATCC TAGTGATGGT ACTTGGCAAG GAAAACAG 
(2) INFORMATION FOR SEQ ID NO : 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 
ACTGCTGCAG ATCTTTGCCA CCTAGCTTCT CATTG 
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(2) INFORMATION ^ FOR SEQ ID NO : 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 
GTCAGGATCC TGGGATTCAA TATGTCAGAG ATGATACTAG 
(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID, NO: 302 
CTAGAAGCTT ACGCACCCAT TCACCATTAT CATTG 
(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 
GTCAGGATCC GGATAATAGA GAAGCATTAA AAACC 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304 
AGTCAAGCTT GACAAAATCT TGAAACTCCT CTGGTC 
(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 
GTCAGGATCC AGATTTTGTC GAGGAGTGTC ATACC 
(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 
AGTCAAGCTT TCCCTTTTTA CCCTTACGAA TCCAGG 
(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
GACTGGATCC ATCTGTAGTT TATGCGGATG AAACACTTAT TAG 
(2) INFORMATION FOR SEQ ID NO : 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 
GACTGTCGAC GCTTTGGTAG AGATAGAAGT CATG 
(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
GACTGGATCC TTACTTTGGT ATCGTAGATA CAGCCGGC 
(2) INFORMATION FOR SEQ ID NO: 310: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 310: 
AGTCAAGCTT TGTTAATTGC GTACCTTCTA AGCGACC 37 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
GACTGGATCC AGCTAAGGTT GCATGGGATG CGATTCG 37 
(2) INFORMATION FOR SEQ ID NO: 312: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 
GACTGTCGAC CTGGGCTTTA TTAGTTTGAC TAGC 34 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
CAGTGGATCC CTATCACTAT GTAAATAAAG AGA 33 
(2) INFORMATION FOR SEQ ID NO: 314: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
ACTGAAGCTT TTCTGTCCCT GTTTGAGGCA 3 0 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
CAGTGGATCC TGAGACTCCT CAATCAATAA CAAA 34 
(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
ACGTAAGCTT ATAATCAGTA GGAGAAACTG AACT 34 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 
CAGTGGATCC GGATGCTCAA GAAACTGCGG 30 
(2) INFORMATION FOR SEQ ID NO : 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

GACTAAGCTT TTGCCTCTCA TTCTTGCTTC C 31 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTGGATCC CGACAAAGGT GAGACTGAG 
(2) INFORMATION FOR' SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ACGTAAGCTT ATTTCTTAAT TCAAGTGTTT TCTCTG 
(2) INFORMATION FOR SEQ ID NO: 321: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC AAATCAATTG GTAGCACAAG ATCC 
(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTGTCGAC ATTAGGAGCC ACTGGTCTC 
(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
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CAGTGGATCC CAAACAGTCA GCTTCAGGAA C 31 ' 

(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
GACTCTGCAG TTTAATCTTG TCCCAGGTGG 30 
(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 325: 
GACTGGATCC ATTCGATGAT GCGGATGAAA AG 32 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 
GACTAAGCTT CATTTGTCTT TGGGTATTTC GCA 33 
(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 
CAGTGGATCC GGAGAGTCGA TCAAAAGTAG 30 
(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 
GTCACTGCAG TTGCTCGTCT CGAGGTTC 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 
CAGTGGATCC ATGGACAACA GGAAACTGGG AC 
(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 
CAGTAAGCTT ATTAGCTTCT GTACCTGTGT TTG 
(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 
GACTGGATCC CGATGGGCTC AATCCAACCC CAGGTCAAGT C 
(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 
GACTCTGCAG CATAGCTTTA TCCTCTGACA TCATCGTATC 
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(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 
GACTGGATCC TTCCAATCAA AAACAGGCAG ATGG 34 
(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 
GACTAAGCTT GAGTCCCATA GTCCAAGGCA 30 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
AGTCGGATCC TATCACAGGA TCGAACGGTA AGACAACC 38 
(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 
ACTGGTCGAC TTCTTTTAAC TCCGCTACTG TGTC 34 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 
CAGTGGATCC AAGTTCATCG AAGATGGTTG GGAAGTCC 38 
(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
GATCGTCGAC CCGCTCCCAC ATGCTCAACC TT 32 
(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 
TGACGGATCC ATCGCTAGCT AGTGAAATGC AAGAAAG 37 
(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 340: 
TGACAAGCTT ATTCGTTTTT GAACTAGTTG CTTTCGT 37 
(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 
GACTGGATCC GCACCAGATG GGGCACAAGG TTCAGGG 37 
(2) INFORMATION FOR SEQ ID NO: 342: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 
TGACAAGCTT AACTTGTAAC GAACAGTTCA ATCTG 35 
(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 
GACTAGATCT TTTTAACCCA ACTGTTGGTA CTTTCC 36 
(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 
TGACAAGCTT GTTAGGTGTT ACATTTTGAC CGTC 34 
(2) INFORMATION FOR SEQ ID NO : 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
ACTGAGATCT TTTTAACCCA ACTGTTGGTA CTTTC 35 
(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
GACTAAGCTT TCTACGATAA CGATCATTTT CTTTACC 37 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 
GACTGTCGAC TCGTAGATAT TTAAGTCTAA GTGAAGCG 38 
(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
?(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 
AGTCAAGCTT GTTAGGTGTT ACATTTTGCA AGTC 34 
(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
GACTGGATCC CTTTGGTTTT GAAGGAAGTA AG 32 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

TGACCTGCAG ACGATTTTTG AAAAATGGAG GTGTATC 37 

(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 
CAGTGGATCC CTACTACCTC TCGAGAGAAA G 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 



ACTGAAGCTT TTCGCTTTTT ACTCGTTTGA CA 



(2) INFORMATION FOR SEQ ID NO: 353: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 




( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 



CAGTGGATCC TAAGGTCAAA AGTCAGACCG CTAAGAAAGT GC 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 
CAGTAAGCTT TAGGGTATCC AAATACTGGT TGTTGATG 
(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 
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TGACAGATCT TGACGGGTCT CAGGATCAGA CTCAGG 
(2) INFORMATION FOR SEQ ID NO : 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 356 
TGACAAGCTT CAAAGACATC CACCTCTTGA CCTTTG 
(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 
GACTGGATCC TAGAGGCTTT GCCAAATGGT GGGAAGGG 
(2) INFORMATION FOR SEQ ID NO : 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 
GTCAGTCGAC TTGTTGTAAC ACTTTTCGAG GTTTGGTACC 
(2) INFORMATION FOR SEQ ID NO : 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 
CAGTGGATCC TCAAAAAGAG AAGGAAAACT TGG 
(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 
CAGTCTGCAG TTTCTTCAAC AAACCTTGTT CTTG 34 
(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 
CAGTGGATCC ACGTTCTATT GAGGACCACT T 31 
(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 
CAGTAAGCTT TTCCTTCTCA GTCAATTCTT TTCC 34 
(2) INFORMATION FOR SEQ ID NO : 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 
GACTGGATCC CGCTCAAAAT ACCAGAGGTG TTCAG 35 
(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
GACTAAGCTT AGTACCATGG GTGTGACAGG TTTGAA 
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(2) INFORMATION FOR -SEQ ID NO: 3 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE:' nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQID NO: 365 
CTGAGGATCC AATTGTACAA TTAGAAAAAG ATAGC 
(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366 
TGACAAGCTT GCGTTGACTA GGTTCTGCAA TGCC 
(2) INFORMATION FOR SEQ ID NO : 3 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 
GACTGGATCC TCTGACCAAG CAAAAAGAAG CAGTCAATGA 
(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 68 
TCAGCAGCTG ATCATTGACT TTACGATTTG CTCC 
(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 369 
GACTGGATCC GTCCGGCTCT GTCCAGTCCA CTTTTTCAGC G 
(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 
TCAGAAGCTT ATTTTTTGTT TCCTTAATGC GTT 
(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 
GACTGGATCC GGGACAAATT CAAAAAAATA GGCAAGAGG 
(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 
GTCAAAGCTT TGGCTCTTTG ATTGCCAACA ACTG 
(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 
GACTGGATCC TCGCTACCAG CAACAAAGCG AGCAAAAGG 
(2) INFORMATION FOR SEQ ID NO: 374: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 
GACTAAGCTT ACTTTTTTCT TTTTCCACAC GA 32 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 
CAGTGGATCC GAACCGACAA GTCGCCCACT ATCAAGACT 39 
(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 
CTGAAAGCTT TGAATTCTCT TTCTTTTCAG GCT 33 
(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 
TCGAGGATCC GGTTGTCGGC TGGCAATATA TCCCGT 36 
(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 
CAGTAAGCTT CCGAACCCAT TCGCCATTAT AGTTGAC 37 
(2) INFORMATION FOR SEQ ID NO : 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
AGTCGGATCC GGCCAAATCA GAATGGGTAG AAGAC 35 
(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 
TGACCTGCAG CTTCTCATTG ATTTTCATCA TCAC 34 
(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 
GACTGGATCC ATTTGCAGAT GATTCTGAAG GATGG 35 
(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

TCAGCTGCAG CTTAACCCAT TCACCATTCT AGTTTAAG 38 

(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 
GACTGGATCC TGTCGCTGCA AATGAAACTG AAGTAGC 37 
(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 
GACTAAGCTT ATACCAAACG CTGACATCTA CGCG 34 
(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
AGTCAGATCT TACGTCTCAG CCTACTTTTG TAAGAGC 37 
(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 
GACTAAGCTT AACCCATTCA CCATTGGCAT TGAC 34 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 



CAGTGGATCC TGGACAGGTG AAAGGTCATG CTACATTTGT G 
(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 
GACTAAGCTT CAACCATTGA GACCTTGCAA CAC 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 
GTCAGGATCC GATTGCTCCT TTGAAGGATT TGAGAGAAAC C 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 
GACTAAGCTT CGATCAAAGA TAAGATAAAT ATATATAAAG T 
(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 
GACTGGATCC TAGGTCATAT GGGACTTTTT TTCTACAACA AAATAGG 
(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 
TGACAAGCTT ATCTATCAGC TCATTTAATC GTTTTTG 37 
(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
CTGAGGATCC CAACGTTGAG AATTATTTGC GAATG 35 
(2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 
TGACAAGCTT GAGTCTACAA AAGTAATGTA C 31 
(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 
GTCAGGATCC CTACTATCAA TCAAGTTCTT CAGCC 35 
(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 
TGACAAGCTT GACTGAGGCT TGGACCAGAT TGAAAAG 



37 
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(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 
GACTGGATCC GACAAAAACA TTAAAACGTC CTGAGG 36 
(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 
GACTAAGCTT AGCACGAACT GTGACGCTGG TTCC 34 
(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 
GACTGGATCC TTCTCAGGAG ACCTTTAAAA ATATC 35 
(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 
GACTAAGCTT GTTGGCCATC TTGTACATAC C 31 
(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 
GACTGGATCC AGTAAATGCG CAATCAAATT C 
(2) INFORMATION FOR SEQ ID NO : 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 
AGTCCTGCAG GTATTTAGCC CAATAATCTA TAAAGCT 
(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403 
CAGTGGATCC TTACCGCGTT CATCAAGATG TC 
(2) INFORMATION FOR SEQ ID NO : 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404 
GACTAAGCTT GCCAGATGTT GAAAAGAGAG TG 
(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 
GACTGGATCC GTGGATGGGC TTTAACTATC TTCGTATTCG 
(2) INFORMATION FOR SEQ ID NO: 406: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 
AGTCAAGCTT GCTAGTCTTC ACTTTCCCTT TCC 33 
(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 
GACTGTCGAC ACTAAACCAG CATCGTTCGC AGGA 34 
(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 
CTGACTGCAG CTTCTTGAAG AAATAATGAT TGTGG 35 
(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 
CAGTGGATCC TGACTACCTT GAAATCCCAC TT 32 
(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



261 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 
CAGTAAGCTT TTTTTTAAGG TTGTAGAATG ATTTCAATC 39 
(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 
CAGTGTCGAC TCGTATCTTT TTTTGGAGCA ATGTT 35 
(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 
GACTAAGCTT AAATGTTCCG ATACGGGTGA TTG 33 
(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 
CAGTGGATCC GGACTCTCTC AAAGATGTGA AAG 33 
(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) . TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

GACTAAGCTT CTTGAGTTTG TCAAGGATTG CTTT 34 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 
CAGTGGATCC CAAGAAATCC TATCATCTCT TCCAGAAG 38 
(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 
GACTAAGCTT TTCAGAACTA AAAGCCGCAG CTT 33 
(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 
GACTGGATCC ACGAAATGCA GGGCAGACAG 3 0 

(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 
CAGTAAGCTT ATCAACATAA TCTAGTAAAT AAGCGT 36 
(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 419: 
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CAGTGGATCC TGTATAGTTT TTAGCGCTTG TTCTTC 36 
(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 
GTCAAAGCTT TGATAGAGTG TCATAATCTT CTTTAG 36 
(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 
GACTGGATCC GTGTGTCGAG CATATTCTGA AG 32 
(2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 
CAGTAAGCTT ACTTTTACCA TTTCTTTGTT CTGCATC 37 
(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 
GACTGTCGAC GTGTTTGGAT AGCATTCAGA ATCAGACG 38 
(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 
CAGTAAGCTT CGGAAGTAAA GACAATTTTT CC 
(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 
CAGTGGATCC GTGCCTAGAT AGTATTATTA CTCAAAC 
(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 
GACTAAGCTT TTTGCTTATT TCTCTCAATT TTTC 
(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
CAGTGGATCC CATTCAGAAG CAGACCTATC AAAATC 
(2) INFORMATION FOR SEQ ID NO: 428: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
ACTGAAGCTT ATGTAATTTT TTAGATTTTT CAATATTTTT CAG 
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(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429 
AGTCGGATCC TAAGGCTGAT AATCGTGTTC AAATG 
(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 
GACTAAGCTT AAAATTAGAT AGACGTTGAG T 
(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 
AGTCGGATCC CTGTGGCAAT CAGTCAGCTG CTTCC 
(2) INFORMATION FOR SEQ ID NO: 432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 
GACTGTCGAC TTTAATCTTG TCCCAGGTGG TTAATTTGCC 
(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433 
ACTGGTCGAC TTGTCAACAA CAACATGCTA CTTCTGAG 
(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 
GACTCTGCAG AAGTTTAACC CACTTATCAT TATCC 
(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 
ACTGGGATCC TTGTTCAGGC AAGTCCGTGA CTAGTGAAC 
(2) INFORMATION FOR SEQ ID NO : 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 
GACTAAGCTT GGCTAATTCC TTCAAAGTTT GCA 
(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 
AGTCGGATCC CTCGCAAATT GAAAAGGCGG CAGTTAGCC 
(2) INFORMATION FOR SEQ ID NO: 438: 
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(i) SEQUENCE CHARACTERISTICS: 1 

(A) LENGTH: 35 base pairs ^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 
GACTAAGCTT GTAAATAAGC GTACCTTTTT CTTCC 35 
(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 
TCAGGGATCC TTGTCAGTCA GGTTCTAATG GTTCTCAG 38 
(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 
AGTCAAGCTT GGCATTGGCG TCGCCGTCCT TC 32 
(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 
GACTGGATCC GGAAACTTCA CAGGATTTTA AAGAGAAG 38 
(2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 
GACTGTCGAC AATCAATCCT TCTTCTGCAC TTCT 
(2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 
CAGTGGATCC TGTGGTCGAA GTTGAGACTC CTCAATC 
(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 
GACTAAGCTT TTCTTCAAAT TTATTATCAG C 
(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 
AGTCGGATCC AACACCTGTA TATAAAGTTA CAGCAATCG 
(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446 

GACTGTCGAC TACTTGACCG AATGCGTCGA ATGTACG 

(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 
CTGAGGATCC ATTAGACAGA TTAATTGAAA TCGG 34 
(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 
GACTGTCGAC TTTAAAGATT GAAGTTTTAA AGCT 34 
(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 
TGACGGATCC TAAGACAGAT GAACGGAGCA AGGTG 35 
(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 
CTGAAAGCTT TAAGGCTTCC TCAATGAGTT TGTCT 35 
(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 



I 
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GACTGGATCC CTGTGAGAAT CAAGCTACAC CCA 
(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 
CTGAAAGCTT TTGTAACTGA GATTGATCTG GGAG 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:453: 

Leu Ala Ser Gly Ala Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 

Leu Pro Xaa Thr Gly Xaa 
1 5 



